Image from Willett, FR, et. al. A high-performance speech neuroprosthesis. Nature (2023). Diagram of the decoding algorithm. First, neural activity (multiunit threshold crossings and spike band power) is temporally binned and smoothed on each electrode. Second, a recurrent neural network (RNN) converts a time series of this neural activity into a time series of probabilities of each phoneme (plus the probability of an interword ‘silence’ token and a ‘blank’ token associated with the connectionist temporal classification training procedure). The RNN is a five-layer, gated recurrent unit architecture trained using TensorFlow 2. Finally, phoneme probabilities are combined with a large-vocabulary language model (a custom, 125,000-word trigram model implemented in Kaldi) to decode the most probable sentence. Phonemes in this diagram are denoted using the International Phonetic Alphabet.
Neuroscientists have developed a bionic device powered by AI that helps aphasic patients regain their speech, turning futuristic concepts into reality.1 This innovative approach is revolutionizing treatment and offering hope to many.
Thanks for reading The Longevity Ledger! Subscribe for free to receive new posts and support my work.
Willett, F.R., Kunz, E.M., Fan, C. et al. A high-performance speech neuroprosthesis. Nature (2023). https://doi.org/10.1038/s41586-023-06377-x
Discussion about this episode
Share this post
The Longevity Ledger Podcast
A healthcare newsletter showcasing topics on longevity that aims to help us understand how to increase our lifespan and improve our healthspan. It is designed to help its readers start taking care of their health now.
A healthcare newsletter showcasing topics on longevity that aims to help us understand how to increase our lifespan and improve our healthspan. It is designed to help its readers start taking care of their health now.
Share this post