MelonBread.dev

Qwen2.5-Omni-7B just released and it is easily on one my new favorite LLMs as it can take Text, Audio, Images, & Video as input and even gives you audio as a output with the text.
https://huggingface.co/Qwen/Qwen2.5-Omni-7B

You can play with it here:
https://huggingface.co/spaces/Qwen/Qwen2.5-Omni-7B-Demo
replies
0
announces
1
likes
0