I fine-tuned Cohere Transcribe to support diarization and timestamps

r/LocalLLaMA
Machine Learning AI Research

Hi I'll keep it short: Cohere-transcribe is currently the best open source speech to text model (and possibly even better than other