NeMo Forced Aligner

Demo for NeMo Forced Aligner (NFA). Upload audio in Tamazight and (optionally) the text spoken in the audio to generate a video where each part of the text will be highlighted as it is spoken.

You can also download CTM and ASS files to add subtitles to your videos.

Input

Microphone input

File upload

[Optional] The reference text. Use '|' separators to specify which text will appear together. Leave this field blank to use an ASR model's transcription as the reference text instead.

Separate text on new lines

Output

Output Video

You can use this space to convert CTM files to SRT format.

Tutorial: "How to use NFA?" 🚀 | Blog post: "How does forced alignment work?" 📚 | NFA Github page 👩‍💻

Examples

File upload	[Optional] The reference text. Use '\|' separators to specify which text will appear together. Leave this field blank to use an ASR model's transcription as the reference text instead.