VLM Run Hub: Pre-defined Pydantic Schemas for Simplified Visual Data Extraction
2025-02-20
VLM Run Hub is a comprehensive repository of pre-defined Pydantic schemas for extracting structured data from unstructured visual domains like images, videos, and documents. Designed for Vision Language Models (VLMs) and optimized for real-world use cases, it simplifies integrating visual ETL into your workflows. It offers various pre-defined schemas, such as an Invoice schema for extracting invoice metadata, and supports multiple VLMs including OpenAI's GPT-4o and Anthropic's Claude Vision. Using Pydantic schemas ensures accurate and reliable data extraction and simplifies downstream workflows.
Development
Vision Language Models