Inference at the edge · ggerganov llama.cpp · Discussion #205
Inference at the edge Based on the positive responses to whisper.cpp, and more recently, llama.cpp, it looks like there is a strong and growing interest for doing efficient transformer model infere...