MLC-SLM Challenge Registration Is in Full Swing

LOS ANGELES, CA, UNITED STATES, May 14, 2026 /EINPresswire.com/ — ๐—ง๐—ต๐—ฒ ๐Ÿฎ๐—ป๐—ฑ ๐— ๐˜‚๐—น๐˜๐—ถ๐—น๐—ถ๐—ป๐—ด๐˜‚๐—ฎ๐—น ๐—–๐—ผ๐—ป๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐—ฎ๐—น ๐—ฆ๐—ฝ๐—ฒ๐—ฒ๐—ฐ๐—ต ๐—Ÿ๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น ๐—–๐—ต๐—ฎ๐—น๐—น๐—ฒ๐—ป๐—ด๐—ฒ (๐— ๐—Ÿ๐—–-๐—ฆ๐—Ÿ๐—  ๐—–๐—ต๐—ฎ๐—น๐—น๐—ฒ๐—ป๐—ด๐—ฒ ๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฒ) ๐—ถ๐˜€ ๐—ป๐—ผ๐˜„ ๐—ฎ๐˜๐˜๐—ฟ๐—ฎ๐—ฐ๐˜๐—ถ๐—ป๐—ด ๐—ฎ๐—ฐ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—ฟ๐—ฒ๐—ด๐—ถ๐˜€๐˜๐—ฟ๐—ฎ๐˜๐—ถ๐—ผ๐—ป!

With the rapid development of large language models (LLMs) and speech language models (Speech LLMs), speech recognition and spoken language understanding are moving toward unified modeling. However, real-world multilingual conversational scenarios still present major challenges, including ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ ๐—ฑ๐—ถ๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ถ๐˜๐˜†, ๐—ฎ๐—ฐ๐—ฐ๐—ฒ๐—ป๐˜ ๐˜ƒ๐—ฎ๐—ฟ๐—ถ๐—ฎ๐˜๐—ถ๐—ผ๐—ป, ๐˜€๐—ฝ๐—ฒ๐—ฎ๐—ธ๐—ฒ๐—ฟ ๐˜๐˜‚๐—ฟ๐—ป๐˜€, ๐—ฐ๐—ผ๐—บ๐—ฝ๐—น๐—ฒ๐˜… ๐—ฑ๐—ถ๐—ฎ๐—น๐—ผ๐—ด๐˜‚๐—ฒ ๐˜€๐˜๐—ฟ๐˜‚๐—ฐ๐˜๐˜‚๐—ฟ๐—ฒ๐˜€, ๐—ฎ๐—ป๐—ฑ ๐—ถ๐—ป๐˜€๐˜‚๐—ณ๐—ณ๐—ถ๐—ฐ๐—ถ๐—ฒ๐—ป๐˜ ๐˜€๐—ฒ๐—บ๐—ฎ๐—ป๐˜๐—ถ๐—ฐ ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด. Results from the first MLC-SLM Challenge showed that Speech LLMs have achieved strong performance in speech recognition, while there remains significant room for further exploration in ๐˜€๐—ฝ๐—ฒ๐—ฎ๐—ธ๐—ฒ๐—ฟ ๐—ฑ๐—ถ๐—ฎ๐—ฟ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฎ๐—ป๐—ฑ ๐—ฑ๐—ฒ๐—ฒ๐—ฝ๐—ฒ๐—ฟ ๐˜€๐—ฝ๐—ฒ๐—ฒ๐—ฐ๐—ต ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด for complex multilingual conversations. Building on this, the 2nd MLC-SLM Challenge aims to further advance Speech LLMs in ๐˜€๐—ฝ๐—ฒ๐—ฎ๐—ธ๐—ฒ๐—ฟ ๐—ฑ๐—ถ๐—ฎ๐—ฟ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป, ๐—ฎ๐—ฐ๐—ผ๐˜‚๐˜€๐˜๐—ถ๐—ฐ ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด, ๐—ฎ๐—ป๐—ฑ ๐˜€๐—ฒ๐—บ๐—ฎ๐—ป๐˜๐—ถ๐—ฐ ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด.

The training set for this yearโ€™s challenge has been further expanded from the first edition, adding more language variants and accents such as ๐—–๐—ฎ๐—ป๐—ฎ๐—ฑ๐—ถ๐—ฎ๐—ป ๐—™๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ต, ๐— ๐—ฒ๐˜…๐—ถ๐—ฐ๐—ฎ๐—ป ๐—ฆ๐—ฝ๐—ฎ๐—ป๐—ถ๐˜€๐—ต, ๐—ฎ๐—ป๐—ฑ ๐—•๐—ฟ๐—ฎ๐˜‡๐—ถ๐—น๐—ถ๐—ฎ๐—ป ๐—ฃ๐—ผ๐—ฟ๐˜๐˜‚๐—ด๐˜‚๐—ฒ๐˜€๐—ฒ. The training data totals approximately ๐Ÿฎ,๐Ÿญ๐Ÿฌ๐Ÿฌ ๐—ต๐—ผ๐˜‚๐—ฟ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—ฐ๐—ผ๐˜ƒ๐—ฒ๐—ฟ๐˜€ ๐—ฎ๐—ฟ๐—ผ๐˜‚๐—ป๐—ฑ ๐Ÿญ๐Ÿฐ ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ๐˜€, providing richer and more realistic data support for research on multilingual conversational speech language models.

๐— ๐—ฎ๐—ท๐—ผ๐—ฟ ๐˜‚๐—ฝ๐—ฑ๐—ฎ๐˜๐—ฒ: ๐˜๐—ต๐—ฒ ๐—ผ๐—ณ๐—ณ๐—ถ๐—ฐ๐—ถ๐—ฎ๐—น ๐—ฏ๐—ฎ๐˜€๐—ฒ๐—น๐—ถ๐—ป๐—ฒ ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€ ๐—ณ๐—ผ๐—ฟ ๐˜๐—ต๐—ถ๐˜€ ๐˜†๐—ฒ๐—ฎ๐—ฟโ€™๐˜€ ๐—ฐ๐—ต๐—ฎ๐—น๐—น๐—ฒ๐—ป๐—ด๐—ฒ ๐—ต๐—ฎ๐˜ƒ๐—ฒ ๐—ป๐—ผ๐˜„ ๐—ฏ๐—ฒ๐—ฒ๐—ป ๐—ฟ๐—ฒ๐—น๐—ฒ๐—ฎ๐˜€๐—ฒ๐—ฑ!

Task 1 focuses on multilingual ๐—ฐ๐—ผ๐—ป๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐—ฎ๐—น ๐˜€๐—ฝ๐—ฒ๐—ฒ๐—ฐ๐—ต ๐˜€๐—ฝ๐—ฒ๐—ฎ๐—ธ๐—ฒ๐—ฟ ๐—ฑ๐—ถ๐—ฎ๐—ฟ๐—ถ๐˜‡๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฎ๐—ป๐—ฑ ๐—ฟ๐—ฒ๐—ฐ๐—ผ๐—ด๐—ป๐—ถ๐˜๐—ถ๐—ผ๐—ป. The baseline system is built on ๐— ๐—ถ๐—ฐ๐—ฟ๐—ผ๐˜€๐—ผ๐—ณ๐˜โ€™๐˜€ ๐—ผ๐—ฝ๐—ฒ๐—ป-๐˜€๐—ผ๐˜‚๐—ฟ๐—ฐ๐—ฒ ๐—ฉ๐—ถ๐—ฏ๐—ฒ๐—ฉ๐—ผ๐—ถ๐—ฐ๐—ฒ-๐—”๐—ฆ๐—ฅ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น and fine-tuned with the challenge training set.

Task 2 focuses on ๐—บ๐˜‚๐—น๐˜๐—ถ๐—น๐—ถ๐—ป๐—ด๐˜‚๐—ฎ๐—น ๐—ฐ๐—ผ๐—ป๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ฎ๐˜๐—ถ๐—ผ๐—ป๐—ฎ๐—น ๐˜€๐—ฝ๐—ฒ๐—ฒ๐—ฐ๐—ต ๐˜‚๐—ป๐—ฑ๐—ฒ๐—ฟ๐˜€๐˜๐—ฎ๐—ป๐—ฑ๐—ถ๐—ป๐—ด. The baseline system uses ๐—š๐—ฒ๐—บ๐—ถ๐—ป๐—ถ ๐Ÿฎ.๐Ÿฑ ๐—ฃ๐—ฟ๐—ผ to construct multiple-choice questions for acoustic and semantic understanding, and is fine-tuned based on ๐—ค๐˜„๐—ฒ๐—ป๐Ÿฎ.๐Ÿฑ-๐—ข๐—บ๐—ป๐—ถ-๐Ÿณ๐—• ๐—ฎ๐—ป๐—ฑ ๐˜๐—ต๐—ฒ ๐—บ๐˜€-๐˜€๐˜„๐—ถ๐—ณ๐˜ ๐˜๐—ผ๐—ผ๐—น๐—ธ๐—ถ๐˜.

Participating teams can now refer to the official baseline systems to accelerate system development, experimental validation, and model optimization.

Teams from both academia and industry are continuing to join the challenge. Notably, employees from ๐—ก๐—ฉ๐—œ๐——๐—œ๐—” ๐—ฎ๐—ป๐—ฑ ๐—๐—ฃ๐— ๐—ผ๐—ฟ๐—ด๐—ฎ๐—ป ๐—–๐—ต๐—ฎ๐˜€๐—ฒ have already formed teams to participate, reflecting strong interest from leading global technology and financial institutions in ๐—บ๐˜‚๐—น๐˜๐—ถ๐—น๐—ถ๐—ป๐—ด๐˜‚๐—ฎ๐—น ๐˜€๐—ฝ๐—ฒ๐—ฒ๐—ฐ๐—ต ๐—น๐—ฎ๐—ป๐—ด๐˜‚๐—ฎ๐—ด๐—ฒ ๐—บ๐—ผ๐—ฑ๐—ฒ๐—น ๐˜๐—ฒ๐—ฐ๐—ต๐—ป๐—ผ๐—น๐—ผ๐—ด๐—ถ๐—ฒ๐˜€.

Whether you work on speech recognition, speaker diarization, speech understanding, multimodal large models, or multilingual data and evaluation, MLC-SLM offers a platform to compete and collaborate with researchers, engineers, and industry teams from around the world.

We welcome ๐˜‚๐—ป๐—ถ๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ถ๐˜๐—ถ๐—ฒ๐˜€, ๐—ฟ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—ถ๐—ป๐˜€๐˜๐—ถ๐˜๐˜‚๐˜๐—ถ๐—ผ๐—ป๐˜€, ๐—ฒ๐—ป๐˜๐—ฒ๐—ฟ๐—ฝ๐—ฟ๐—ถ๐˜€๐—ฒ ๐˜๐—ฒ๐—ฎ๐—บ๐˜€, ๐—ฎ๐—ป๐—ฑ ๐—ถ๐—ป๐—ฑ๐—ถ๐˜ƒ๐—ถ๐—ฑ๐˜‚๐—ฎ๐—น ๐—ฟ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต๐—ฒ๐—ฟ๐˜€ to register and participate. Join us in advancing the development of multilingual conversational speech language models!

Registration is ongoing. We look forward to your participation.
Official Website Link: https://www.nexdata.ai/competition/mlc-slm
Registration Link: https://forms.gle/jfAZ95abGy4ZiNHo7

Nexdata
MLC-SLM Competition Committee
mlc-slmw@nexdata.ai
Visit us on social media:
LinkedIn
Facebook
YouTube
X

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Media gallery