Automatische Transkription von englischen und deutschen qualitativen Interviews

Autor/innen

  • Susanne Wollin-Giering Technische Universität Berlin
  • Markus Hoffmann Technische Universität Berlin
  • Jonas Höfting Technische Universität Berlin
  • Carla Ventzke Senatsverwaltung für Inneres und Sport

DOI:

https://doi.org/10.17169/fqs-25.1.4129

Schlagworte:

qualitative Interviews, Transkription, Methoden, Automatisierung, Datenschutz, Genauigkeit, automatische Transkription

Abstract

Die Aufnahme und Transkription von Interviews sind zentrale, aber ressourcen- und zeitintensive Schritte in der qualitativen Sozialforschung. In Bezug darauf haben Forscher*innen verschiedene Alternativen vorgeschlagen, unter anderem automatische Transkription mithilfe von Spracherkennungsalgorithmen. Ob solche automatisch erstellten Transkripte die Qualität von durch Menschen erstellte Transkripte erreichen, ist noch unklar. In diesem Beitrag vergleichen wir systematisch mehrere automatische Transkriptionswerkzeuge: Amberscript, Dragon, F4x, Happy Scribe, NVivo, Sonix, Trint, Otter und Whisper. Wir bewerten Aspekte des Datenschutzes, der Genauigkeit, der nötigen Zeit und der Kosten anhand eines englischsprachigen und eines deutschsprachigen Interviews. Unsere Analyse ergibt, dass Whisper insgesamt am besten abschneidet und dass ähnliche lokal-automatische Transkriptionswerkzeuge in der Zukunft wahrscheinlich relevanter werden. Um höchstmögliche Genauigkeit zu erreichen, empfehlen wir ein Durchlesen des Texts für jegliche Art der Transkription. Wir stellen einen Rahmen zum Vergleich von Transkriptionswerkzeugen bereit und hoffen, einen Beitrag zur Diskussion der Brauchbarkeit von automatischer Transkription leisten zu können.

Downloads

Keine Nutzungsdaten vorhanden.

Autor/innen-Biografien

Susanne Wollin-Giering, Technische Universität Berlin

Susanne WOLLIN-GIERING is currently working at Technical University Berlin in the projects "FUFAF – Functions and Consequences of Unemployment in Researchers Careers" and "EPAC – Effects of Pandemic-related Disruptions on Academic Careers". In her work she focuses on the intersection between the sociology of work and the sociology of science, particularly in the area of academic careers.

Markus Hoffmann, Technische Universität Berlin

Markus HOFFMANN is a research associate and PhD candidate at the Technical University Berlin in the research group for social studies of science and technology. He is currently working on two projects to investigate the field-specific effects and ways of handling unemployment and restrictions induced by the COVID-19 pandemic respectively. His research interests include field-comparative science studies and qualitative approaches of analysing content.

Jonas Höfting, Technische Universität Berlin

Jonas HÖFTING is currently a Master's student in Information Science at Humboldt University Berlin. He supports the projects "FUFAF – Functions and Consequences of Unemployment in Researchers Careers" and "EPAC – Effects of Pandemic-related Disruptions on Academic Careers" as a student assistant at Technical University Berlin.

Carla Ventzke, Senatsverwaltung für Inneres und Sport

Carla VENTZKE supported the projects "FUFAF – Functions and Consequences of Unemployment in Researchers Careers" and "EPAC – Effects of Pandemic-related Disruptions on Academic Careers" as a student assistant at Technical University Berlin.

Literaturhinweise

Ayaß, Ruth (2015). Doing data: The status of transcripts in conversation analysis. Discourse Studies, 17(5), 505-528.

Bokhove, Christian & Downey, Christopher (2018). Automated generation of "good enough" transcripts as a first step to transcription of audio-recorded data. Methodological Innovations, 11(2), https://doi.org/10.1177/2059799118790743 [Accessed: July 17, 2023].

Brinkmann, Svend (2014). Unstructured and semi-structured interviewing. In Patricia Leavy (Ed.), The Oxford handbook of qualitative research (pp.277-299). Oxford: Oxford University Press.

Bucholtz, Mary (2007). Variation in transcription. Discourse Studies, 9(6), 784-808.

Collins, Harry; Leonard-Clarke, Willow & O'Mahoney, Hannah (2019). "Um, Er", How meaning varies between speech and its typed transcript. Qualitative Research, 19(6), 653-668.

Da Silva, Joseph (2021). Producing "good enough" automated transcripts securely: Extending Bokhove and Downey (2018) to address security concerns. Methodological Innovations, 14(1), https://doi.org/10.1177/2059799120987766 [Accessed: July 17, 2023].

Davidson, Christina (2009). Transcription: Imperatives for qualitative research. International Journal of Qualitative Methods, 8(2), 35-52, https://doi.org/10.1177/160940690900800206 [Accessed: July 17, 2023].

Easton, Lee; Lexier, Roberta; Lindstrom, Gabrielle & Yeo, Michelle (2019). Uncovering the complicit: The disrupting interview as a decolonising practice. In Lynn Quinn (Ed.), Re-imagining curriculum: Spaces for disruption (pp.149-169). Stellenbosch: African Sun Media.

Edwards, Rosalind & Holland, Janet (2013). What is qualitative interviewing? London: Bloomsbury.

Evers, Jeanine C. (2011). From the past into the future. How technological developments change our ways of data collection, transcription and analysis. Forum Qualitative Sozialforschung / Forum: Qualitative Social Research, 12(1), Art. 38, https://doi.org/10.17169/fqs-12.1.1636 [Accessed: July 17, 2023].

Favre, Benoit; Cheung, Kyla; Kazemian, Siavash; Lee, Adam; Liu, Yang; Munteanu, Cosmin, Nenkova, Ani; Ochei, Dennis; Penn, Gerald; Tratz, Stephen; Voss, Clare & Zeller, Frauke (2013). Automatic human utility evaluation of ASR systems: Does WER really predict performance? Proceedings Interspeech, 3463-3467.

Flick, Uwe (2009). An introduction to qualitative research (4th ed.). Los Angeles, CA: Sage.

Fuß, Susanne & Karbach, Ute (2019). Grundlagen der Transkription, Eine praktische Einführung (2nd ed.). Opladen: Barbara Budrich.

Gläser, Jochen & Laudel, Grit (2010). Experteninterviews und qualitative Inhaltsanalyse als Instrumente rekonstruierender Untersuchungen (4th ed.). Wiesbaden: VS Verlag für Sozialwissenschaften.

Gläser, Jochen & Laudel, Grit (2019). The discovery of causal mechanisms: Extractive qualitative content analysis as a tool for process tracing. Forum Qualitative Sozialforschung / Forum: Qualitative Social Research, 20(3), Art. 29, https://doi.org/10.17169/fqs-20.3.3386 [Accessed: November 26, 2023].

Halcomb, Elizabeth J. & Davidson, Patricia M (2006). Is verbatim transcription of interview data always necessary? Applied Nursing Research, 19(1), 38-42.

Hammersley, Martyn (2010). Reproducing or constructing? Some questions about transcription in social research. Qualitative Research, 10(5), 553-569.

Heselwood, Barry (2013). Phonetic transcription in theory and practice. Edinburgh: Edinburgh University Press.

Jefferson, Gail (1985a). An exercise in the transcription and analysis of laughter. In Teun A. van Dijk (Ed.), Handbook of discourse analysis. Volume 3: Discourse and dialog (pp.25-34). London: Academic Press.

Jefferson, Gail (1985b). On the interactional unpackaging of a "gloss". Language in Society, 14(4), 435-466.

Jefferson, Gail (2004). Glossary of transcript symbols with an introduction. In Gene H. Lerner (Ed.), Conversation analysis: Studies from the first generation (pp.13-31). Amsterdam: John Benjamins Publishing Company.

Korobov, Neill (2001). Reconciling theory with method: From conversation analysis and critical discourse analysis to positioning analysis. Forum Qualitative Sozialforschung / Forum: Qualitative Social Research, 2(3), Art. 11, https://doi.org/10.17169/fqs-2.3.906 [Accessed: July 17, 2023].

Kowal, Sabine & O'Connell, Daniel C. (2014). Transcription as a crucial step of data analysis. In Uwe Flick (Ed.), The Sage handbook of qualitative data analysis (pp.64-78). London: Sage.

Kreuz, Roger J. & Riordan, Monica A. (2011). The transcription of face-to-face interaction. In Wolfram Bublitz & Neal R. Norrick (Eds.), Foundations of pragmatics (pp.657-679). Berlin: De Gryuter.

Kvale, Steinar & Brinkmann, Svend (2009). InterViews: Learning the craft of qualitative research interviewing (2nd ed.). Los Angeles, CA: Sage.

Lapadat, Judith C. & Lindsay, Anne C. (1999). Transcription in research and practice: From standardization of technique to interpretive positionings. Qualitative Inquiry, 5(1), 64-86.

Laudel, Grit & Bielick, Jana (2019). Forschungspraktische Probleme bei der Archivierung von leitfadengestützten Interviews. Forum Qualitative Sozialforschung / Forum: Qualitative Social Research, 20(2), Art. 10, https://doi.org/10.17169/fqs-20.2.3077 [Accessed: November 26, 2023].

Liyanagunawardena, Tharindu (2019). Automatic transcription software: Good enough for accessibility? A case study from built environment education. EDEN Conference Proceedings, Volume(1), 388-396.

Loubere, Nicholas (2017). Questioning transcription: The case for the systematic and reflexive interviewing and reporting (SRIR) method. Forum Qualitative Sozialforschung / Forum: Qualitative Social Research, 18(2), Art. 15, https://doi.org/10.17169/fqs-18.2.2739 [Accessed: July 17, 2023].

MacLean, Lynne M.; Meyer, Mechthild & Estable, Alma (2004). Improving accuracy of transcripts in qualitative research. Qualitative Health Research, 14(1), 113-23.

Marge, Matthew; Banerjee, Satanjeev & Rudnicky, Alexander I. (2010). Using the Amazon mechanical turk for transcription of spoken language. In Institute of Electrical and Electronics Engineers (IEEE) (Eds.), 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2010) (pp.5270-5273). Red Hook, NY: Curran Associates, Inc., https://doi.org/10.1109/ICASSP.2010.5494979 [Accessed: November 26, 2023].

Markle, D. Thomas; West, Richard Edward & Rich, Peter J. (2011). Beyond transcription: Technology, change, and refinement of method. Forum Qualitative Sozialforschung / Forum: Qualitative Social Research, 12(3), Art. 21, https://doi.org/10.17169/fqs-12.3.1564 [Accessed: October 12, 2023].

Mishler, Elliot G. (2003). Representing discourse: The rhetoric of transcription. In Yvonna S. Lincoln & Norman K. Denzin (Eds.), Turning points in qualitative research: Tying knots in a handkerchief (pp.297-326). Lanham: AltaMira Press.

Moore, Robert J. (2015). Automated transcription and conversation analysis. Research on Language and Social Interaction, 48(3), 253-270.

Ochs, Elinor (1979). Transcription as theory. In Elinor Ochs & Bambi B. Schieffelin (Eds.), Developmental pragmatics (pp.43-73). New York, NY: Academic Press.

Oliver, Daniel G.; Serovich, Julianne M. & Mason, Tina L. (2005). Constraints and opportunities with interview transcription: Towards reflection in qualitative research. Social Forces, 84(2), 1273-1289.

Park, Julia & Zeanah, A. Echo (2005). An evaluation of voice recognition software for use in interview-based research: A research note. Qualitative Research, 5(2), 245-251.

Paulus, Trena; Lester, Jessica & Dempster, Paul (2014). Digital tools for qualitative research. London: Sage.

Psathas, George & Anderson, Timothy (1990). The "practices" of transcription in conversation analysis. Semiotica, 78(1-2), 75-99.

Reichertz, Jo (2016). Qualitative und interpretative Sozialforschung: Eine Einladung. Wiesbaden: Springer VS.

Rutakumwa, Rwamahe; Mugisha, Joseph Okello; Bernays, Sarah; Kabunga, Elizabeth; Tumwekwase, Grace; Mbonye, Martin & Seeley, Janet (2020). Conducting in-depth interviews with and without voice recorders: A comparative analysis. Qualitative Research, 20(5), 565-581. https://doi.org/10.1177/1468794119884806 [Accessed: July 17, 2023].

Sacks, Harvey; Schegloff, Emanuel A. & Jefferson, Gail (1974). A simplest systematics for the organization of turn-taking for conversation. Language, 50(4), 696-735.

Tilley, Susan A. & Powick, Kelly D. (2002). Distanced data: Transcribing other people's research tapes. Canadian Journal of Education / Revue canadienne de l'éducation, 27(2/3), 291-310.

von Neumann, Thilo; Boeddeker, Christoph; Kinoshita, Keisuke; Delcroix, Marc & Haeb-Umbach, Reinhold (2022). On word error rate definitions and their efficient computation for multi-speaker speech recognition systems. arXiv, https://doi.org/10.48550/arXiv.2211.16112 [Accessed: July 17, 2023].

Veröffentlicht

2024-01-29

Zitationsvorschlag

Wollin-Giering, S., Hoffmann, M., Höfting, J., & Ventzke, C. (2024). Automatische Transkription von englischen und deutschen qualitativen Interviews. Forum Qualitative Sozialforschung Forum: Qualitative Social Research, 25(1). https://doi.org/10.17169/fqs-25.1.4129

Ausgabe

Rubrik

Einzelbeiträge