VOOZH about

URL: https://huggingface.co/papers/2304.13134

⇱ Paper page - LAST: Scalable Lattice-Based Speech Modelling in JAX


Papers
arxiv:2304.13134

LAST: Scalable Lattice-Based Speech Modelling in JAX

Published on Apr 25, 2023
Authors:
,
,

Abstract

LAST is a JAX library implementing differentiable WFSA algorithms for scalable speech transduction, addressing modern architecture and automatic differentiation challenges.

We introduce LAST, a LAttice-based Speech Transducer library in JAX. With an emphasis on flexibility, ease-of-use, and scalability, LAST implements differentiable weighted finite state automaton (WFSA) algorithms needed for training \& inference that scale to a large WFSA such as a recognition lattice over the entire utterance. Despite these WFSA algorithms being well-known in the literature, new challenges arise from performance characteristics of modern architectures, and from nuances in automatic differentiation. We describe a suite of generally applicable techniques employed in LAST to address these challenges, and demonstrate their effectiveness with benchmarks on TPUv3 and V100 GPU.

Community

· Sign up or log in to comment

Get this paper in your agent:

hf papers read 2304.13134

Models citing this paper 2

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2304.13134 in a dataset README.md to link it from this page.

Spaces citing this paper 19

Browse 19 spaces citing this paper

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.