Introduction.html 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148
  1. <!DOCTYPE html>
  2. <html class="writer-html5" lang="en" >
  3. <head>
  4. <meta charset="utf-8" />
  5. <meta name="viewport" content="width=device-width, initial-scale=1.0" />
  6. <title>Introduction &mdash; MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 documentation</title><link rel="stylesheet" href="_static/css/theme.css" type="text/css" />
  7. <link rel="stylesheet" href="_static/pygments.css" type="text/css" />
  8. <!--[if lt IE 9]>
  9. <script src="_static/js/html5shiv.min.js"></script>
  10. <![endif]-->
  11. <script id="documentation_options" data-url_root="./" src="_static/documentation_options.js"></script>
  12. <script src="_static/jquery.js"></script>
  13. <script src="_static/underscore.js"></script>
  14. <script src="_static/doctools.js"></script>
  15. <script src="_static/language_data.js"></script>
  16. <script crossorigin="anonymous" integrity="sha256-Ae2Vz/4ePdIu6ZyI/5ZGsYnb+m0JlOmKPjt6XZ9JJkA=" src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.4/require.min.js"></script>
  17. <script async="async" src="https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/latest.js?config=TeX-AMS-MML_HTMLorMML"></script>
  18. <script type="text/x-mathjax-config">MathJax.Hub.Config({"tex2jax": {"inlineMath": [["$", "$"], ["\\(", "\\)"]], "processEscapes": true, "ignoreClass": "tex2jax_ignore|mathjax_ignore|document", "processClass": "tex2jax_process|mathjax_process|math|output_area"}})</script>
  19. <script src="_static/js/theme.js"></script>
  20. <link rel="index" title="Index" href="genindex.html" />
  21. <link rel="search" title="Search" href="search.html" />
  22. <link rel="next" title="Datasets" href="Dataset.html" />
  23. <link rel="prev" title="ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)" href="index.html" />
  24. </head>
  25. <body class="wy-body-for-nav">
  26. <div class="wy-grid-for-nav">
  27. <nav data-toggle="wy-nav-shift" class="wy-nav-side">
  28. <div class="wy-side-scroll">
  29. <div class="wy-side-nav-search" >
  30. <a href="index.html" class="icon icon-home">
  31. MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0
  32. </a>
  33. <div role="search">
  34. <form id="rtd-search-form" class="wy-form" action="search.html" method="get">
  35. <input type="text" name="q" placeholder="Search docs" aria-label="Search docs" />
  36. <input type="hidden" name="check_keywords" value="yes" />
  37. <input type="hidden" name="area" value="default" />
  38. </form>
  39. </div>
  40. </div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
  41. <p class="caption"><span class="caption-text">Contents:</span></p>
  42. <ul class="current">
  43. <li class="toctree-l1 current"><a class="current reference internal" href="#">Introduction</a><ul>
  44. <li class="toctree-l2"><a class="reference internal" href="#call-for-participation">Call for participation</a></li>
  45. <li class="toctree-l2"><a class="reference internal" href="#timeline-aoe-time">Timeline(AOE Time)</a></li>
  46. <li class="toctree-l2"><a class="reference internal" href="#guidelines">Guidelines</a></li>
  47. </ul>
  48. </li>
  49. <li class="toctree-l1"><a class="reference internal" href="Dataset.html">Datasets</a></li>
  50. <li class="toctree-l1"><a class="reference internal" href="Track_setting_and_evaluation.html">Track &amp; Evaluation</a></li>
  51. <li class="toctree-l1"><a class="reference internal" href="Baseline.html">Baseline</a></li>
  52. <li class="toctree-l1"><a class="reference internal" href="Rules.html">Rules</a></li>
  53. <li class="toctree-l1"><a class="reference internal" href="Challenge_result.html">Challenge Result</a></li>
  54. <li class="toctree-l1"><a class="reference internal" href="Organizers.html">Organizers</a></li>
  55. <li class="toctree-l1"><a class="reference internal" href="Contact.html">Contact</a></li>
  56. </ul>
  57. </div>
  58. </div>
  59. </nav>
  60. <section data-toggle="wy-nav-shift" class="wy-nav-content-wrap"><nav class="wy-nav-top" aria-label="Mobile navigation menu" >
  61. <i data-toggle="wy-nav-top" class="fa fa-bars"></i>
  62. <a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a>
  63. </nav>
  64. <div class="wy-nav-content">
  65. <div class="rst-content">
  66. <div role="navigation" aria-label="Page navigation">
  67. <ul class="wy-breadcrumbs">
  68. <li><a href="index.html" class="icon icon-home" aria-label="Home"></a></li>
  69. <li class="breadcrumb-item active">Introduction</li>
  70. <li class="wy-breadcrumbs-aside">
  71. <a href="_sources/Introduction.md.txt" rel="nofollow"> View page source</a>
  72. </li>
  73. </ul>
  74. <hr/>
  75. </div>
  76. <div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
  77. <div itemprop="articleBody">
  78. <div class="section" id="introduction">
  79. <h1>Introduction<a class="headerlink" href="#introduction" title="Permalink to this headline"></a></h1>
  80. <div class="section" id="call-for-participation">
  81. <h2>Call for participation<a class="headerlink" href="#call-for-participation" title="Permalink to this headline"></a></h2>
  82. <p>Automatic speech recognition (ASR) and speaker diarization have made significant strides in recent years, resulting in a surge of speech technology applications across various domains. However, meetings present unique challenges to speech technologies due to their complex acoustic conditions and diverse speaking styles, including overlapping speech, variable numbers of speakers, far-field signals in large conference rooms, and environmental noise and reverberation.</p>
  83. <p>Over the years, several challenges have been organized to advance the development of meeting transcription, including the Rich Transcription evaluation and Computational Hearing in Multisource Environments (CHIME) challenges. The latest iteration of the CHIME challenge has a particular focus on distant automatic speech recognition and developing systems that can generalize across various array topologies and application scenarios. However, while progress has been made in English meeting transcription, language differences remain a significant barrier to achieving comparable results in non-English languages, such as Mandarin. The Multimodal Information Based Speech Processing (MISP) and Multi-Channel Multi-Party Meeting Transcription (M2MeT) challenges have been instrumental in advancing Mandarin meeting transcription. The MISP challenge seeks to address the problem of audio-visual distant multi-microphone signal processing in everyday home environments, while the M2MeT challenge focuses on tackling the speech overlap issue in offline meeting rooms.</p>
  84. <p>The ICASSP2022 M2MeT challenge focuses on meeting scenarios, and it comprises two main tasks: speaker diarization and multi-speaker automatic speech recognition. The former involves identifying who spoke when in the meeting, while the latter aims to transcribe speech from multiple speakers simultaneously, which poses significant technical difficulties due to overlapping speech and acoustic interferences.</p>
  85. <p>Building on the success of the previous M2MeT challenge, we are excited to propose the M2MeT2.0 challenge as an ASRU 2023 challenge special session. In the original M2MeT challenge, the evaluation metric was speaker-independent, which meant that the transcription could be determined, but not the corresponding speaker. To address this limitation and further advance the current multi-talker ASR system towards practicality, the M2MeT2.0 challenge proposes the speaker-attributed ASR task with two sub-tracks: fixed and open training conditions. The speaker-attribute automatic speech recognition (ASR) task aims to tackle the practical and challenging problem of identifying “who spoke what at when”. To facilitate reproducible research in this field, we offer a comprehensive overview of the dataset, rules, evaluation metrics, and baseline systems. Furthermore, we will release a carefully curated test set, comprising approximately 10 hours of audio, according to the timeline. The new test set is designed to enable researchers to validate and compare their models’ performance and advance the state of the art in this area.</p>
  86. </div>
  87. <div class="section" id="timeline-aoe-time">
  88. <h2>Timeline(AOE Time)<a class="headerlink" href="#timeline-aoe-time" title="Permalink to this headline"></a></h2>
  89. <ul class="simple">
  90. <li><p><span class="math notranslate nohighlight">\( April~29, 2023: \)</span> Challenge and registration open.</p></li>
  91. <li><p><span class="math notranslate nohighlight">\( May~11, 2023: \)</span> Baseline release.</p></li>
  92. <li><p><span class="math notranslate nohighlight">\( May~22, 2023: \)</span> Registration deadline, the due date for participants to join the Challenge.</p></li>
  93. <li><p><span class="math notranslate nohighlight">\( June~16, 2023: \)</span> Test data release and leaderboard open.</p></li>
  94. <li><p><span class="math notranslate nohighlight">\( June~20, 2023: \)</span> Final submission deadline and leaderboar close.</p></li>
  95. <li><p><span class="math notranslate nohighlight">\( June~26, 2023: \)</span> Evaluation result and ranking release.</p></li>
  96. <li><p><span class="math notranslate nohighlight">\( July~3, 2023: \)</span> Deadline for paper submission.</p></li>
  97. <li><p><span class="math notranslate nohighlight">\( July~10, 2023: \)</span> Deadline for final paper submission.</p></li>
  98. <li><p><span class="math notranslate nohighlight">\( December~12\ to\ 16, 2023: \)</span> ASRU Workshop and Challenge Session.</p></li>
  99. </ul>
  100. </div>
  101. <div class="section" id="guidelines">
  102. <h2>Guidelines<a class="headerlink" href="#guidelines" title="Permalink to this headline"></a></h2>
  103. <p>Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 22, 2023.</p>
  104. <p><a class="reference external" href="https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link">M2MeT2.0 Registration</a></p>
  105. <p>Within three working days, the challenge organizer will send email invitations to eligible teams to participate in the challenge. All qualified teams are required to adhere to the challenge rules, which will be published on the challenge page. Prior to the ranking release time, each participant must submit a system description document detailing their approach and methods. The organizer will select the top ranking submissions to be included in the ASRU2023 Proceedings.</p>
  106. </div>
  107. </div>
  108. </div>
  109. </div>
  110. <footer><div class="rst-footer-buttons" role="navigation" aria-label="Footer">
  111. <a href="index.html" class="btn btn-neutral float-left" title="ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)" accesskey="p" rel="prev"><span class="fa fa-arrow-circle-left" aria-hidden="true"></span> Previous</a>
  112. <a href="Dataset.html" class="btn btn-neutral float-right" title="Datasets" accesskey="n" rel="next">Next <span class="fa fa-arrow-circle-right" aria-hidden="true"></span></a>
  113. </div>
  114. <hr/>
  115. <div role="contentinfo">
  116. <p>&#169; Copyright 2023, Speech Lab, Alibaba Group; ASLP Group, Northwestern Polytechnical University.</p>
  117. </div>
  118. Built with <a href="https://www.sphinx-doc.org/">Sphinx</a> using a
  119. <a href="https://github.com/readthedocs/sphinx_rtd_theme">theme</a>
  120. provided by <a href="https://readthedocs.org">Read the Docs</a>.
  121. </footer>
  122. </div>
  123. </div>
  124. </section>
  125. </div>
  126. <script>
  127. jQuery(function () {
  128. SphinxRtdTheme.Navigation.enable(true);
  129. });
  130. </script>
  131. </body>
  132. </html>