Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Student works
  4. Bring voice user interfaces to our offices
 
semester or other student projects

Bring voice user interfaces to our offices

Dümbgen, Frederike  
2018

<strong>Synopsis</strong>: Implement a new way of interacting with your computer via voice control instead of the mouse and keyboard. <br><br> <strong>Level</strong>:BS, MS <br><br> <strong>Description</strong>: Google Home and Amazon Alexa are quickly revolutionizing how we interact with smart devices. Both use “wake words” (“OK Google” and “Alexa” respectively) to detect the user’s intention to interact. While the wake word detection is typically done on the device to insure minimum latency, the user’s commands following it are usually processed remotely. <br> <br> The goal of this project is to program a microcontroller to process acoustic data locally and in real time. The microcontroller should run a speech recognition model to extract specific commands from the spoken words of the user. The chip should then emulate a USB device such as a mouse or keyboard buttons and send the derived commands to trigger actions on the host computer. An important aspect of the project will be to understand the limits of what can be processed on the microcontroller, in terms of memory and computation time.<br> <br> The student has the option to either work on implementing machine learning models such as CNNs on the microchip, or to work on emulating the USB peripheral. Ideally, we will have two students working on both components of the project such that we have a full working system in the end of the semester. <br><br> <strong>Deliverables</strong>: A report and a working system with clear documentation. <br><br> <strong>References</strong>: for useful links, see list of URLs below. <br><br> <strong>Prerequisites</strong>: First part: Knowledge of or strong interest for machine learning, in particular neural networks. Basics in programming of embedded systems. <br>Second part: Basics of C programming, embedded systems, preferably knowledge of USB devices. <br><br> <strong>Type of Work</strong>: 50% algorithm design/analysis, 50% programming

  • Details
  • Metrics
Type
semester or other student projects
Author(s)
Dümbgen, Frederike  
Advisors
Hoffet, Adrien  
•
Dümbgen, Frederike
Date Issued

2018

Subjects

LCAVPP

•

BS_SEMESTER

•

MS_SEMESTER

URL

URL

https://www.tensorflow.org/versions/master/tutorials/audio_recognition

URL

https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/data

URL

https://developer.arm.com/technologies/neon

URL

https://developer.arm.com/technologies/compute-library
Written at

EPFL

EPFL units
LCAV  
Available on Infoscience
December 8, 2017
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/142686
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés