Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model
ASLP-lab
ASLP-lab
AI & ML interests
None yet
Recent Activity
updated a dataset about 2 hours ago
ASLP-lab/SongFormDB updated a dataset about 16 hours ago
ASLP-lab/SongFormBench updated a model about 16 hours ago
ASLP-lab/SongFormerOrganizations
None yet
spaces 8
Configuration error
Agents
9
YingMusic-Singer-Plus
🎤
Edit lyrics, keep the melody
Runtime error
Agents
12
WenetSpeech Yue
🔥
Large-Scale Cantonese Speech Corpus
Runtime error
Agents
1
VoiceSculptor
📚
Running on Zero
Agents
44
DiffRhythm2
🎵
Generate a full song from lyrics and style prompts
Configuration error
Agents
22
SongFormer
🎵
State-of-the-art music analysis with multi-scale datasets
Running on Zero
Agents
Featured
687
Di♪♪Rhythm
🎶
Blazingly Fast and Embarrassingly Simple Song Generation
models 35
ASLP-lab/SongFormer
0.7B • Updated • 353 • 17
ASLP-lab/FM-Speech
Audio Classification • Updated
ASLP-lab/Speaker-Reasoner
32B • Updated • 70 • 1
ASLP-lab/Speaker-Reasoner-4194h
32B • Updated • 76
ASLP-lab/YingMusic-Singer-Plus
Updated • 1.83k • 7
ASLP-lab/OmniCodec
Feature Extraction • Updated • 1
ASLP-lab/OSUM-Pangu
Audio-to-Audio • Updated • 2
ASLP-lab/VoiceSculptor-VD
Text-to-Speech • 4B • Updated • 25 • 18
ASLP-lab/WenetSpeech-Wu-Speech-Understanding
Updated
ASLP-lab/WenetSpeech-Wu-Speech-Generation
Text-to-Speech • Updated • 2
datasets 19
ASLP-lab/SongFormDB
Updated • 4.44k • 6
ASLP-lab/SongFormBench
Viewer • Updated • 3.82k • 554 • 2
ASLP-lab/FMSU-Bench
Updated • 14
ASLP-lab/HumDial-FDBench
Updated • 198 • 2
ASLP-lab/FastTurn-Testset
Updated • 55
ASLP-lab/WSC-Train
Preview • Updated • 464 • 120
ASLP-lab/LyricEditBench
Viewer • Updated • 7.2k • 286 • 2
ASLP-lab/WenetSpeech-Wu-Bench
Viewer • Updated • 242 • 390 • 4
ASLP-lab/WenetSpeech-Wu
Updated • 31 • 1
ASLP-lab/WenetSpeech-Yue
Updated • 433 • 41