Recognize text and elements in images
nanonets2 / dots.ocr / olmOCR2 / chandraOCR
Generate a 3D mesh model from an image
Transcribe audio files into text