Mapping the multimodal AI landscape with a structured taxonomy
AI platforms make it hard to filter models by multimodal capabilities. I built an open-source taxonomy that maps which inputs produce which outputs.
Tag
2 posts
AI platforms make it hard to filter models by multimodal capabilities. I built an open-source taxonomy that maps which inputs produce which outputs.
A curated resource list of multimodal AI models with native audio support — models that process audio tokens, not just transcribe.