Drop any banking document into S3 and call POST /pipeline/ingest.
Orkes orchestrates Textract OCR, Bedrock embeddings, and parallel storage — all visible as a live DAG in the Orkes UI.
1,000 real banking PDFs are ready to ingest on demand.
start_document_text_detection. Polls until job completes. Returns raw text + page count. Handles scanned / image PDFs via real OCR./pipeline/step/* endpoints directly. No polling daemon, no EC2, no open terminal.
Just trigger the workflow and Orkes does the rest.
orkes_ui_url pointing directly to the running execution. Open it to see: