This workflow transforms n8n into an API endpoint for extracting structured data from images using Google Gemini AI. By sending an image URL and defining the required fields, the workflow processes the image, extracts relevant details, and returns clean JSON output.
Itβs perfect for OCR-based automation tasks such as processing identity documents, invoices, receipts, or business cards.
Features
- π₯ Webhook API Endpoint β Accepts image URLs and extraction requirements
- πΌ Image Handling β Fetches the image and converts it into base64 format for processing
- π€ AI-Powered Extraction β Leverages Gemini Flash Lite for advanced image-to-text analysis
- π― Customizable Output β Define your own properties (e.g., Name, Date of Birth, PAN Number, Validity)
- π Structured JSON Response β Returns clean, schema-driven JSON results
- β‘ Ready-to-Use API β Integrates easily with applications for automated data entry and workflows
- π§ Flexible Use Cases β Supports document OCR, form digitization, text extraction, and more
