Enhancing Usability of Voice Interfaces for Socially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (2024)

Enhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (2)

Advanced Search

Browse

Article

Free Access

  • Authors:
  • Oliver Guhr https://ror.org/05q5pk319HTW Dresden, Friedrich-List-Platz 1, 01069, Dresden, Germany https://ror.org/042aqky30Technische Universität Dresden, 01062, Dresden, Germany

    https://ror.org/05q5pk319HTW Dresden, Friedrich-List-Platz 1, 01069, Dresden, Germany

    https://ror.org/042aqky30Technische Universität Dresden, 01062, Dresden, Germany

    Search about this author

    ,
  • Claudia Loitsch https://ror.org/042aqky30Technische Universität Dresden, 01062, Dresden, Germany Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Dresden, Germany

    https://ror.org/042aqky30Technische Universität Dresden, 01062, Dresden, Germany

    Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Dresden, Germany

    Search about this author

    ,
  • Gerhard Weber https://ror.org/042aqky30Technische Universität Dresden, 01062, Dresden, Germany

    https://ror.org/042aqky30Technische Universität Dresden, 01062, Dresden, Germany

    Search about this author

    ,
  • Hans-Joachim Böhme https://ror.org/05q5pk319HTW Dresden, Friedrich-List-Platz 1, 01069, Dresden, Germany

    https://ror.org/05q5pk319HTW Dresden, Friedrich-List-Platz 1, 01069, Dresden, Germany

    Search about this author

Artificial Intelligence in HCI: 5th International Conference, AI-HCI 2024, Held as Part of the 26th HCI International Conference, HCII 2024, Washington, DC, USA, June 29–July 4, 2024, Proceedings, Part IIIJun 2024Pages 231–249https://doi.org/10.1007/978-3-031-60615-1_15

Published:29 June 2024Publication History

  • 0citation
  • 0
  • Downloads

Metrics

Total Citations0Total Downloads0

Last 12 Months0

Last 6 weeks0

  • Get Citation Alerts

    New Citation Alert added!

    This alert has been successfully added and will be sent to:

    You will be notified whenever a record that you have chosen has been cited.

    To manage your alert preferences, click on the button below.

    Manage my Alerts

    New Citation Alert!

    Please log in to your account

  • Publisher Site

Artificial Intelligence in HCI: 5th International Conference, AI-HCI 2024, Held as Part of the 26th HCI International Conference, HCII 2024, Washington, DC, USA, June 29–July 4, 2024, Proceedings, Part III

Enhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study

Pages 231–249

PreviousChapterNextChapter

Enhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (3)

Abstract

Voice Interfaces have become ubiquitous as they can make complex technology more usable and accessible. Current voice interfaces, however, often require the user to learn specific speech commands or sentence patterns to use them. This property does not satisfy usability heuristics and causes current language interfaces to underachieve the naturalness of language interaction. To address this issue, we developed a voice interface that is capable of understanding natural everyday language. The overall objective is to build a German language voice interface for socially assistive robots that can work in public spaces. Therefore, we cannot assume the user’s prior knowledge or experience. Based on recent advances in deep natural language processing, we have built a voice interface that is not restricted to specific speech commands. To test this voice interface, we conducted a study with 47 participants. Results indicate 93% of the given tasks were solved successfully by the target user group without prior training or experience with the voice interface.

References

  1. 1.Agarwal, A., Zesch, T.: German end-to-end speech recognition based on DeepSpeech (2019). https://www.semanticscholar.org/paper/German-End-to-end-Speech-Recognition-based-on-Agarwal-Zesch/06d32982297d1f46909a26c794941b1e05af1f1cGoogle ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (4)
  2. 2.Ardila, R., et al.: Common voice: a massively-multilingual speech corpus. arXiv:1912.06670 [cs], March 2020Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (5)
  3. 3.Babu, A., et al.: XLS-R: Self-supervised cross-lingual speech representation learning at scale. arXiv:2111.09296 [cs, eess], December 2021. DOI: https://doi.org/10.48550/arXiv.2111.09296Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (6)Cross Ref
  4. 4.Baevski, A., et al.: wav2vec 2.0: a framework for self-supervised learning of speech representations. arXiv:2006.11477 [cs, eess], October 2020Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (8)
  5. 5.Bastianelli, E., et al.: Speaky for robots: the development of vocal interfaces for robotic applications. Appl. Intell. 44(1), 43–66 (2016). ISSN: 0924-669X, 1573-7497. DOI: https://doi.org/10.1007/s10489-015-0695-5. http://link.springer.com/10.1007/s10489-015-0695-5Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (9)Digital Library
  6. 6.Driess, D., et al. PaLM-E: an embodied multimodal language model. arXiv:2303.03378 [cs], March 2023. DOI: https://doi.org/10.48550/arXiv.2303.03378Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (11)Cross Ref
  7. 7.Flake, R., et al.: IW-Trends 3/2018 Fachkräfteengpass in der Altenpflege. German. Technical report, 45. IW Köln, March 2018. https://www.iwkoeln.de/fileadmin/user_upload/Studien/IW-Trends/PDF/2018/IW-Trends_2018-03-02_Pflegefallzahlen.pdfGoogle ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (13)
  8. 8.Hedderich, J., Sachs, L.: Angewandte Statistik. Springer, Heidelberg (2016). DOI: https://doi.org/10.1007/978-3-662-45691-0. http://link.springer.com/10.1007/978-3-662-45691-0. ISBN: 978-3-662-45690-3 978-3-662-45691-0Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (14)Cross Ref
  9. 9.Hidalgo-Paniagua, A., Millan-Alcaide, A., Bandera, J.P., Bandera, A.: Integration of the Alexa assistant as a voice interface for robotics platforms. In: Silva, M.F., Luís Lima, J., Reis, L.P., Sanfeliu, A., Tardioli, D. (eds.) ROBOT 2019. AISC, vol. 1093, pp. 575–586. Springer, Cham (2020). DOI: https://doi.org/10.1007/978-3-030-36150-1_47. ISBN: 978-3-662-45690-3 978-3-662-45691-0Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (16)Cross Ref
  10. 10.Jakob, D., Wilhelm, S., Gerl, A., Ahrens, D.: A quantitative study on awareness, usage and reservations of voice control interfaces by elderly people. In: Stephanidis, C., et al. (eds.) HCII 2021. LNCS, vol. 13096, pp. 237–257. Springer, Cham (2021). DOI: https://doi.org/10.1007/978-3-030-90328-2_15. ISBN: 978-3-030-90328-2Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (18)Digital Library
  11. 11.Kobayashi, M., et al.: Effects of age-related cognitive decline on elderly user interactions with voice-based dialogue systems. In: Lamas, D., Loizides, F., Nacke, L., Petrie, H., Winckler, M., Zaphiris, P. (eds.) INTERACT 2019. LNCS, vol. 11749, pp. 53–74. Springer, Cham (2019). DOI: https://doi.org/10.1007/978-3-030-29390-1_4. ISBN: 978-3-030-29389-5 978-3-030-29390-1Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (20)Digital Library
  12. 12.Kowalski, J., et al.: Older adults and voice interaction: a pilot study with Google home. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems. CHI EA 2019, pp. 1–6. Association for Computing Machinery, New York, NY, USA, May 2019. DOI: https://doi.org/10.1145/3290607.3312973. ISBN: 978-1-4503-5971-9Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (22)Digital Library
  13. 13.Lewis, J.R., Hardzinski, M.L.: Investigating the psychometric properties of the speech user interface service quality questionnaire. Int. J. Speech Technol. 18(3), 479–487 (2015). DOI: https://doi.org/10.1007/s10772-015-9289-1. ISSN: 1572-8110Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (24)Digital Library
  14. 14.Lewis, J.R., Utesch, B.S., Maher, D.E.: Investigating the correspondence between UMUX-LITE and SUS scores. In: Marcus, A. (ed.) DUXU 2015. LNCS, vol. 9186, pp. 204–211. Springer, Cham (2015). DOI: https://doi.org/10.1007/978-3-319-20886-2_20. ISBN: 978-3-319-20886-2Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (26)Cross Ref
  15. 15.Lewis, J.R., Utesch, B.S., Maher, D.E.: UMUX-LITE: when there’s no time for the SUS. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 2099–2102. ACM, Paris, France, April 2013. DOI: https://doi.org/10.1145/2470654.2481287. https://dl.acm.org/doi/10.1145/2470654.2481287. ISBN: 978-1-4503-1899-0Google ScholarDigital Library
  16. 16.Loitsch, C.: Designing accessible user interfaces for all by means of adaptive systems. Ph.D. thesis. Dresden University of Technology, Germany (2018). https://nbn-resolving.org/urn:nbn:de:bsz:14-qucosa2-319846Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (30)
  17. 17.Müller, S.: Realisierung nutzeradaptiven Interaktionsverhaltens für mobile Assistenzroboter. Ph.D. thesis. Ilmenau, October 2016. https://www.db-thueringen.de/receive/dbt_mods_00030393Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (31)
  18. 18.Norberto Pires, J.: Robot-by-voice: experiments on commanding an industrial robot using the human voice. Ind. Rob. Int. J. 32(6), 505–511 (2005). DOI: https://doi.org/10.1108/01439910510629244. ISSN: 0143-991XGoogle ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (32)Cross Ref
  19. 19.OpenAI. GPT-4 Technical Report. arXiv:2303.08774 [cs], March 2023. DOI: https://doi.org/10.48550/arXiv.2303.08774Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (34)Cross Ref
  20. 20.Papachristos, E., Meldgaard, D.P., Thomsen, I.R., Skov, M.B.: ReflectPal: exploring self-reflection on collaborative activities using voice assistants. In: Ardito, C., et al. (eds.) INTERACT 2021. LNCS, vol. 12935, pp. 187–208. Springer, Cham (2021). DOI: https://doi.org/10.1007/978-3-030-85610-6_12. https://link.springer.com/10.1007/978-3-030-85610-6_12. ISBN: 978-3-030-85609-0 978-3-030-85610-6Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (36)Digital Library
  21. 21.Peng, Z., et al.: Shrinking bigfoot: reducing wav2vec 2.0 footprint. In: SUSTAINLP (2021). DOI: https://doi.org/10.18653/v1/2021.sustainlp-1.14Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (38)Cross Ref
  22. 22.Poirier, S., Routhier, F., Campeau-Lecours, A.: Voice control interface prototype for assistive robots for people living with upper limb disabilities. In: 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR), Toronto, ON, Canada, pp. 46–52. IEEE, June 2019. DOI: https://doi.org/10.1109/ICORR.2019.8779524. https://ieeexplore.ieee.org/document/8779524/. ISBN: 978-1-72812-755-2Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (40)Digital Library
  23. 23.Polkosky, M.: Toward a social-cognitive psychology of speech technology: affective responses to speech-based e-service, February 2005Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (42)
  24. 24.Prodanov, P.J., et al.: Voice enabled interface for interactive tour-guide robots. In: IEEE/RSJ International Conference on Intelligent Robots and System, Lausanne, Switzerland, vol. 2, pp. 1332–1337. IEEE (2002). DOI: https://doi.org/10.1109/IRDS.2002.1043939. http://ieeexplore.ieee.org/document/1043939/. ISBN: 978-0-7803-7398-3Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (43)Cross Ref
  25. 25.Reimers, N., Gurevych, I.: Making monolingual sentence embeddings multilingual using knowledge distillation. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, November 2020. https://arxiv.org/abs/2004.09813Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (45)
  26. 26.Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using Siamese BERT-networks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, November 2019. https://arxiv.org/abs/1908.10084Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (46)
  27. 27.Salai, A.-M., Cook, G., Holmquist, L.E.: IntraVox: a personalized human voice to support users with complex needs in smart homes. In: Ardito, C., et al. (eds.) INTERACT 2021. LNCS, vol. 12932, pp. 223–244. Springer, Cham (2021). DOI: https://doi.org/10.1007/978-3-030-85623-6_15. ISBN: 978-3-030-85623-6Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (47)Digital Library
  28. 28.Sin, J., et al.: Does Alexa live up to the hype? Contrasting expectations from mass media narratives and older adults’ hands-on experiences of voice interfaces. In: 4th Conference on Conversational User Interfaces, Glasgow, United Kingdom, pp. 1–9. ACM, July 2022. DOI: https://doi.org/10.1145/3543829.3543841. https://dl.acm.org/doi/10.1145/3543829.3543841. ISBN: 978-1-4503-9739-1Google ScholarDigital Library
  29. 29.Stiefelhagen, R., et al.: Enabling multimodal human-robot interaction for the karlsruhe humanoid robot. IEEE Trans. Rob. 23(5), 840–851 (2007). DOI: https://doi.org/10.1109/TRO.2007.907484. http://ieeexplore.ieee.org/document/4339550/. ISSN: 1552-3098, 1941-0468Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (51)Digital Library
  30. 30.Vaswani, A., et al.: Attention is all you need. arXiv:1706.03762 [cs], December 2017. arXiv: 1706.03762Google ScholarEnhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (53)
  31. 31.Wolters, M., et al.: Being old doesn’t mean acting old: how older users interact with spoken dialog systems. ACM Trans. Accessible Comput. 2(1), 1–39 (2009). DOI: https://doi.org/10.1145/1525840.1525842. https://dl.acm.org/doi/10.1145/1525840.1525842. ISSN: 1936-7228, 1936-7236Google ScholarDigital Library

Cited By

View all

Enhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (56)

    Recommendations

    • KinVoices: Using Voices of Friends and Family in Voice Interfaces

      CSCW2

      With voice user interfaces (VUIs) becoming ubiquitous and speech synthesis technology maturing, it is possible to synthesise voices to resemble our friends and relatives (which we will collectively call 'kin') and use them on VUIs. However, designing ...

      Read More

    • A Usability Study of an Assistive Touch Voice Interface based Automated Teller Machine (ATM)

      DEV '15: Proceedings of the 2015 Annual Symposium on Computing for Development

      Automated Teller Machines (ATMs) have become increasingly common in urban areas of the developing world. In Pakistan, low-literate users who previously did not have access to traditional banking services have now been issued ATM cards for direct-cash ...

      Read More

    • Usability of nomadic user interfaces

      HCII'11: Proceedings of the 14th international conference on Human-computer interaction: towards mobile and intelligent interaction environments - Volume Part III

      During the last decade, a number of research activities have been performed to enable user interfaces and the underlying user activities to be migrated from one device to another. We call this "Nomadic User Interfaces". The primary goal of these ...

      Read More

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    Get this Publication

    • Information
    • Contributors
    • Published in

      Enhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (57)

      Artificial Intelligence in HCI: 5th International Conference, AI-HCI 2024, Held as Part of the 26th HCI International Conference, HCII 2024, Washington, DC, USA, June 29–July 4, 2024, Proceedings, Part III

      Jun 2024

      497 pages

      ISBN:978-3-031-60614-4

      DOI:10.1007/978-3-031-60615-1

      • Editors:
      • Helmut Degen

        Siemens Corporation, Princeton, NJ, USA

        ,
      • Stavroula Ntoa

        Foundation for Research and Technology - FORTH, Heraklion, Crete, Greece

      © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024

      Sponsors

        In-Cooperation

          Publisher

          Springer-Verlag

          Berlin, Heidelberg

          Publication History

          • Published: 29 June 2024

          Author Tags

          • Human-Robot Interaction
          • Voice Interface
          • User Study

          Qualifiers

          • Article

          Conference

          Funding Sources

          • Enhancing Usability ofVoice Interfaces forSocially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (58)

            Other Metrics

            View Article Metrics

          • Bibliometrics
          • Citations0
          • Article Metrics

            • Total Citations

              View Citations
            • Total Downloads

            • Downloads (Last 12 months)0
            • Downloads (Last 6 weeks)0

            Other Metrics

            View Author Metrics

          • Cited By

            This publication has not been cited yet

          Digital Edition

          View this article in digital edition.

          View Digital Edition

          • Figures
          • Other

            Close Figure Viewer

            Browse AllReturn

            Caption

            View Table of Contents

            Export Citations

              Your Search Results Download Request

              We are preparing your search results for download ...

              We will inform you here when the file is ready.

              Download now!

              Your Search Results Download Request

              Your file of search results citations is now ready.

              Download now!

              Your Search Results Download Request

              Your search export query has expired. Please try again.

              Enhancing Usability of Voice Interfaces for Socially Assistive Robots Through Deep Learning: A German Case Study | Artificial Intelligence in HCI (2024)
              Top Articles
              Latest Posts
              Article information

              Author: Mr. See Jast

              Last Updated:

              Views: 6052

              Rating: 4.4 / 5 (75 voted)

              Reviews: 82% of readers found this page helpful

              Author information

              Name: Mr. See Jast

              Birthday: 1999-07-30

              Address: 8409 Megan Mountain, New Mathew, MT 44997-8193

              Phone: +5023589614038

              Job: Chief Executive

              Hobby: Leather crafting, Flag Football, Candle making, Flying, Poi, Gunsmithing, Swimming

              Introduction: My name is Mr. See Jast, I am a open, jolly, gorgeous, courageous, inexpensive, friendly, homely person who loves writing and wants to share my knowledge and understanding with you.