Fascinating. This paper (Hugging GPT) proposes using the LLM as traffic cop and delegating to existing specialized AI models the specific tasks (image recognition). Seems to massively increase the power of LLMs.
This pic is one of the simpler examples.
https://arxiv.org/abs/2303.17580&ved=2ahUKEwj2u4Po6Iz-AhWCmIkEHUWSDwIQFnoECBcQAQ&usg=AOvVaw1AdHcwEg-YVtjS9gri6Lfl
🐦🔗: https://n.respublicae.eu/lugaricano/status/1642738354713243649