An incredibly fast implementation of Whisper optimized for Apple Silicon.
10x faster than Whisper CPP, 4x faster than current MLX Whisper implementation.
Install lightning whisper mlx using pip:
pip install lightning-whisper-mlx
Models:
["tiny", "small", "distil-small.en", "base", "medium",
distil-medium.en", "large", "large-v2", "distil-large-v2", "large-v3",
"distil-large-v3"]
Quantization:
[None, "4bit", "8bit"]
from lightning_whisper_mlx import LightningWhisperMLX
whisper = LightningWhisperMLX(model="base", batch_size=12, quant=None)
text = whisper.transcribe(audio_path="/audio.mp3")['text']
print(text)