Initiates a modern, flexible asynchronous data extraction process using a JSON schema for the desired output and a prompt. Returns an ID for tracking.
An array of documents to extract data from.
A JSON schema defining the structure of the desired extraction output.
A prompt guiding the extraction process.
Optional control over the reasoning effort for extraction.
low, medium, high The model to use for extraction.
reasoning-3-mini, reasoning-3, base-5, base-5-mini, base-5-nano, base-4.1, base-4.1-mini, base-4.1-nano, base-5.2, base-5.2-chat-latest, base-5.4, base-5.4-mini, qwen3-next-80b-thinking, qwen3-next-80b-instruct, glm-4.7, deepseek-v3.2, kimi-k2-thinking, gemini-3-pro, gemini-3-flash, gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-lite Optional reranker model ID override for relevance filtering.
cohere.rerank-v3-5:0, amazon.rerank-v1:0 Force using the reranker for relevance filtering even with smaller document sets.
Deprecated. Accepted for backwards compatibility and ignored.
Deprecated. Accepted for backwards compatibility and ignored.
The priority of the extraction task. Non-interactive is lower priority. Background are tasks that can be run in the background while the user is doing other things.
interactive, non-interactive, background OCR enhancement configuration for figure and text analysis.
OCR quality setting
high, low Filter out failed pdfs
Accepted. Advanced extraction process initiated.
Accepted. Advanced extraction process initiated.
The ID to track the asynchronous extraction task.