LLMYourWay
ModelsDevices
Edge AI
CompareInsights
Enterprise

kwilk90

1 models • 1 total models in database
Sort by:

DSpAST

DSpAST: Disentangled Spatial Audio Spectrogram Transformer Checkpoints of DSpAST: Disentangled Representations for Spatial Audio Reasoning with Large Language Models. On our system, the performances obtained with our provided checkpoints are: | Binaural Encoder | mAP (↑) | ER20° (↓) | MAE (↓) | DER (↓) | | :---: | :---: | :---: | :---: | :---: | | SpatialAST | 49.90 | 24.43 | 17.87 | 32.50 | | DSpAST (stage 1) | 53.05 | 98.56 | 95.57 | 97.58 | | DSpAST (stage 2) | 52.64 | 20.31 | 14.44 | 28.35 | | DSpAST (stage 3) | 54.53 | 20.28 | 14.44 | 28.03 | Similar performance improvements can also be observed when using DSpAST as a binaural encoder for spatial audio reasoning with LLMs. Please have a look at our paper for further information. If you use the checkpoints for your work, we kindly ask you to cite the following papers: and the original BAT paper, which is the foundation of this work:

license:cc-by-nc-4.0
0
1
LLMYourWay

The definitive AI model comparison platform. Compare 12K+ models, track performance, and discover the perfect AI solution for your needs.

Made with AI
Real-time Data

Product

  • Find Your Device
  • Browse Models
  • Compare AI
  • Benchmarks
  • Pricing
  • API Access

Resources

  • Blog & Articles
  • Methodology
  • Changelog
  • Trending
  • Use Cases

Company

  • About
  • Contact

Legal

  • Privacy Policy
  • Cookie Policy
  • Terms of Service
12K+12,000+
AI Models Tracked & Updated Daily
© 2026 LLMYourWay. All rights reserved.
Data updated every 4 hours
Powered by real-time AI data
API