Traditional speech recognition methods rely on software-based feature extraction that introduces latency and high energy costs, making them unsuitable for low-power devices. A proof-of-concept demonstration is provided of a bioinspired tonotopic sensor for speech recognition that mimics the human cochlea, using a spiral-shaped elastic metamaterial. The measured modal response of the structure at different frequencies generates a spatially distributed signal, providing a spatiotemporal map of the input named “tonogram”. The device acts as an in-sensor physical reservoir computing system, working simultaneously as a sensor and as a computing unit, capable of extracting features of spoken words relevant to speech recognition. Results indicate that this can serve as a valid alternative to traditional software-based digital preprocessing, ensuring high accuracy in terms of classification, while reducing computational requirements. This work demonstrates the potential of bioinspired metamaterials for energy-efficient auditory sensing and, beyond speech recognition, for applications such as IoT devices and edge computing artificial intelligence systems.

Speech Recognition with Cochlea‐Inspired In‐Sensor Computing / Beoletto, Paolo H.; Milano, Gianluca; Ricciardi, Carlo; Bosia, Federico; Gliozzi, Antonio S.. - In: ADVANCED INTELLIGENT SYSTEMS. - ISSN 2640-4567. - 8:1(2026). [10.1002/aisy.202500526]

Speech Recognition with Cochlea‐Inspired In‐Sensor Computing

Milano, Gianluca;
2026

Abstract

Traditional speech recognition methods rely on software-based feature extraction that introduces latency and high energy costs, making them unsuitable for low-power devices. A proof-of-concept demonstration is provided of a bioinspired tonotopic sensor for speech recognition that mimics the human cochlea, using a spiral-shaped elastic metamaterial. The measured modal response of the structure at different frequencies generates a spatially distributed signal, providing a spatiotemporal map of the input named “tonogram”. The device acts as an in-sensor physical reservoir computing system, working simultaneously as a sensor and as a computing unit, capable of extracting features of spoken words relevant to speech recognition. Results indicate that this can serve as a valid alternative to traditional software-based digital preprocessing, ensuring high accuracy in terms of classification, while reducing computational requirements. This work demonstrates the potential of bioinspired metamaterials for energy-efficient auditory sensing and, beyond speech recognition, for applications such as IoT devices and edge computing artificial intelligence systems.
File in questo prodotto:
File Dimensione Formato  
Advanced Intelligent Systems - 2025 - Beoletto - Speech Recognition with Cochleaâ Inspired Inâ Sensor Computing.pdf

accesso aperto

Tipologia: final published article (publisher’s version)
Licenza: Creative Commons
Dimensione 2.15 MB
Formato Adobe PDF
2.15 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11696/88379
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact