T3 Talk2Text - A model for near real-time voice transcription in virtual group meetings

Group projects thrive on communication, but how can students revisit discussions effortlessly? We present T3 Talk2Text, an open-source tool for real-time meeting transcription, enabling reflection and collaboration analysis. Discover the tech behind it!
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

SpringerLink
SpringerLink SpringerLink

T3 Talk2Text – A model for near real-time voice transcription in virtual group meetings - Discover Education

Group projects are an important part of many educational programs. In these projects, groupware tools like shared workspaces, shared editors, and synchronous or asynchronous communication tools (video conferencing, chat, email, forum) are often utilized. For synchronous collaboration, video conferencing systems are often used since they allow for direct and effortless informal communication via speech and video. If the group is to reflect about the content and way of communication and collaboration within and across meetings, the group needs access to the development of artifacts as well as the conversation of the meetings. While some video conferencing systems allow recording of a meeting, accessing content and dialogue structure requires real-time viewing and detailed note-taking. To reduce this effort, a video chat should include a transcription functionality that is able to support students in an online group work session by providing a video chat with automatic creation of a transcript. For this purpose, a video communication tool named T3 was developed that enables communication between learners and generates a transcript through an integrated automatic speech recognition (ASR) system. The transcript can be used to reflect on the conversations in order to recall the topics discussed or to identify group work problems, such as insufficient participation, coordination and collaboration problems. The implementation of T3 uses voice activation detection, WebRTC and ASR models to maintain a high level of quality in the transcription process. Initial functionality tests demonstrated the ability of T3 to create accurate group discussion transcripts, making it attractive for students and teachers to assess and improve communication and collaboration, and for researchers studying group discussions.

The Challenge: Capturing Group Discussions for Reflection

Group projects are a cornerstone of modern education, but remote collaboration introduces challenges like unequal participation, miscommunication, and the difficulty of recalling past discussions. While video conferencing tools facilitate real-time interaction, they lack built-in support for documenting conversations. Manual note-taking is cumbersome, and post-meeting transcriptions from recordings are time-consuming.

We asked: How can we provide students and educators with an effortless way to capture and reflect on group discussions in near real-time?

Introducing T3 Talk2Text

t3 - logo

Our solution, T3 Talk2Text, is an open-source web application that integrates:

  • WebRTC-based video conferencing (peer-to-peer, no expensive licenses)
  • Automatic Speech Recognition (ASR) and Real-time transcription via OpenAI’s Whisper model
  • On-demand summaries (using Llama3 for AI-generated insights)

Unlike commercial tools (e.g., Microsoft Teams), T3 prioritizes privacy (self-hosted), multilingual support, and accessibility (works on any device with a browser).

 User interface of T3 - T3 in use                      

Key Innovations

  1. Voice Activity Detection (VAD) Pipeline

    • Filters background noise and segments speech for accurate ASR input.

    • Achieves 8.04% Word Error Rate (WER) in German—comparable to human transcribers.

  2. Dynamic Transcript Formats
    Users can download transcripts as:

    • PDFs (messenger-style, with speaker alignment)

    • CSVs (structured for analysis)

    • AI summaries (condensed key points)

      Possible dialog protocol output formats

  3. Scalable Architecture

    • Lightweight SQLite storage for transcripts.

    • Peer-to-peer media streaming reduces server load.

Behind the Scenes: Overcoming Technical Hurdles

Challenge 1: Real-Time Processing
WebRTC’s low-latency streams were ideal for video chat but required careful buffering to feed Whisper ASR without delays. Our VAD component ensured only speech segments were processed, optimizing resource use.

Challenge 2: Multilingual Support
Whisper’s multilingual capabilities let T3 adapt to diverse classrooms, though future work will explore fine-tuning for non-native accents.

Challenge 3: Privacy-First Design
All data stays on institutional servers, and temporary audio files are deleted post-transcription, which is critical for GDPR compliance.

Impact and Future Directions

Initial tests with student groups showed T3 seamlessly integrated into discussions without disrupting collaboration. Educators highlighted its potential for:

  • Identifying participation gaps (via speaker-labeled transcripts).

  • Conflict resolution (revisiting past dialogue).

  • Research (analyzing communication patterns across courses).

Next steps include:

  • Deploying Whisper Large Turbo for faster, even more accurate transcriptions.

  • Longitudinal studies in university courses to measure learning outcomes.

Try It Out!

T3 is open-source and available for institutions to adapt. We welcome collaborations to explore its use in classrooms and beyond.

Read the full paper: The Paper
Code repository: Github
Contact: Thomas.Kasakowskij@fernuni-hagen.de

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Speech and Audio Processing
Technology and Engineering > Electrical and Electronic Engineering > Signal, Speech and Image Processing > Speech and Audio Processing
eLearning
Humanities and Social Sciences > Education > Media Education > Digital Education and Educational Technology > eLearning
Groupwork and Presentation
Humanities and Social Sciences > Education > Skills > Groupwork and Presentation
Higher Education
Humanities and Social Sciences > Education > Higher Education
User Interfaces and Human Computer Interaction
Mathematics and Computing > Computer Science > Computer and Information Systems Applications > User Interfaces and Human Computer Interaction
Natural Language Processing (NLP)
Mathematics and Computing > Computer Science > Artificial Intelligence > Natural Language Processing (NLP)

Related Collections

With Collections, you can get published faster and increase your visibility.

Empowering Education through AI: Opportunities, Challenges and Risk Governance

Artificial Intelligence (AI) is reshaping the landscape of education as a double-edged sword. On one hand, it holds great promise for empowering teaching, learning, assessment, and educational management by making them more efficient, accurate, adaptive, and responsive. On the other hand, the increasing integration of AI in education elicits significant pedagogical, ethical, social, and policy concerns, such as reduced learner agency, algorithmic control, academic misconduct, and exacerbated educational disparities. Without proper risk governance, the AI technologies intended to empower education may end up disrupting the educational processes and wreaking chaos on educational development. This calls for coordinated efforts from policymakers, researchers, and practitioners worldwide to ensure the legitimate, appropriate, responsible, and ethical use of AI in education.

Against this backdrop, this Collection aims to promote critical inquiry into the intricate and multifaceted features of AI use in education. We particularly welcome interdisciplinary perspectives that explicate how policymakers, researchers, educators, and learners can participate in and collaborate to leverage the transformative power of AI while mitigating the potential risks within different educational scenarios and across diversified cultural contexts. Potential topics include (but are not limited to):

1. Social, ethical, cognitive, emotional, and behavioural terrains of AI use in education

2. Human-AI collaboration in teaching, learning, assessment, and educational management

3. Learner agency, self-regulation, and AI-empowered learning

4. Teacher professional development in AI-empowered education

5. AI literacy for educators and learners

6. Educational equity and academic integrity in the AI era

7. Policy innovation and risk governance for AI use in various educational settings

8. Cross-cultural perspectives on AI use and governance in education

This Collection supports and amplifies research related to SDG 4

Keywords: Artificial Intelligence (AI); Human-AI collaboration; AI-empowered education; policy innovation; risk governance; educational equity; academic integrity

Publishing Model: Open Access

Deadline: Aug 26, 2026

Research Trends in STEAM Education

Science, Technology, Engineering, Arts, and Mathematics (STEAM) education has gained global momentum by integrating innovative curriculum, pedagogy, assessment, and community-driven research approaches to promote sustainable citizenship education. The hallmark of STEAM programs lies in engaging students in inquiry-driven learning, connecting theory with practice, and fostering innovation and creativity in teaching and learning. Over time, STEAM education has evolved to integrate various forms of arts, creativity, and humanities, mirroring the scientific breakthroughs and technological advancements of the 21st century. Creativity is a crucial component of STEAM education, and it is often defined as the ability to generate innovative ideas or products that serve their intended purposes. With the development of STEAM educational approaches, it is crucial to explore emerging trends, future research areas, the impact of STEAM education on student outcomes and career readiness, and the role of community and industry partnerships to promote STEAM education for equity and inclusion.

This Collection, "Research Trends in STEAM Education," aims to look at innovative research and practices, creative teaching methods, and growing trends in incorporating STEAM education into primary to university curricula, pedagogy, assessment and research and innovation by enhancing diversity in STEAM fields, addressing gender and socioeconomic disparities, and ensuring the inclusion of students with special needs in all activities. For this Collection, we invite researchers, educators, and practitioners to submit original contributions—empirical studies, reviews, commentaries, perspectives, and case studies—addressing, but not limited to, the following themes:

1. Arts-based pedagogy in STEAM education

2. Innovative research and practices in STEAM education

3. Curriculum development models and/or frameworks for STEAM education

4. Assessment models and/or frameworks for STEAM education

5. Creative teaching methods in STEAM Education

6. Pedagogical models in STEAM education

7. Techno-educational trends in STEAM education

8. Research methodological trends in STEAM education

Keywords: STEAM education; arts-based pedagogy; innovative research; pedagogical approaches; creativity; innovative research and practices; creative teaching methods; curriculum development models; pedagogical models and assessment models; techno-educational trends

Publishing Model: Open Access

Deadline: Dec 04, 2026