Jina AI Reader is an open-source tool designed to convert any URL into an LLM-friendly input format. Developed and maintained by Jina AI, it is licensed under Apache-2.0 and provides high-quality web content extraction services for AI Agents and RAG (Retrieval-Augmented Generation) systems.
https://r.jina.ai/
before any URL.Original URL: https://en.wikipedia.org/wiki/Artificial_intelligence
Converted URL: https://r.jina.ai/https://en.wikipedia.org/wiki/Artificial_intelligence
https://s.jina.ai/
before the query.Query: Who will win 2024 US presidential election?
Search URL: https://s.jina.ai/Who%20will%20win%202024%20US%20presidential%20election%3F
Image [idx]: [caption]
x-with-generated-alt: true
site=example.com
in the query parameters.curl 'https://s.jina.ai/When%20was%20Jina%20AI%20founded%3F?site=jina.ai&site=github.com'
# Enable image descriptions
x-with-generated-alt: true
# Forward Cookie settings
x-set-cookie: [cookie_string]
# Bypass cache
x-no-cache: true
# Custom cache tolerance (seconds)
x-cache-tolerance: [seconds]
# Specify proxy server
x-proxy-url: [proxy_url]
# Target element selector
x-target-selector: [css_selector]
# Wait for specific element to appear
x-wait-for-selector: [css_selector]
# Set timeout
x-timeout: [seconds]
# Return Markdown format (bypass readability filtering)
x-respond-with: markdown
# Return raw HTML
x-respond-with: html
# Return plain text
x-respond-with: text
# Return webpage screenshot URL
x-respond-with: screenshot
# Enable streaming mode
curl -H "Accept: text/event-stream" https://r.jina.ai/[URL]
# Get JSON format response
curl -H "Accept: application/json" https://r.jina.ai/[URL]
JSON Response Structure:
{
"url": "Original URL",
"title": "Page Title",
"content": "Extracted Content"
}
Due to the special nature of SPAs, the following solutions are provided:
For URLs containing #
, use the POST method:
curl -X POST 'https://r.jina.ai/' -d 'url=https://example.com/#/route'
For webpages displaying pre-loaded content:
curl 'https://example.com/' -H 'x-timeout: 30'
curl 'https://example.com/' -H 'x-wait-for-selector: #content'
curl -H "Accept: text/event-stream" https://r.jina.ai/https://example.com/
Jina AI Reader is a powerful open-source tool specifically designed for modern AI systems, addressing the format and quality issues faced by LLMs when processing web content. By simply adding a URL prefix, you can obtain high-quality, structured web content, making it an ideal tool for building AI Agents and RAG systems.