Get the latest tech news
Llama 3.1 Omni Model
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level. - ictnlp/LLaMA-Omni
If you have a good solution, feel free to submit a PR. To run inference locally, please organize the speech instruction files according to the format in the omni_speech/infer/examples directory, then refer to the following script. If you have any questions, please feel free to submit an issue or contact fangqingkai21b@ict.ac.cn.
Or read this on Hacker News