Generate spoken words with optional visual input
Chat-Response-LLAMA
Extract information from PDFs and images
Talk to OpenAI using their multimodal API
Generate images from text prompts
Voice chat with AI that has web access