Skip to main content

Jina AI Reader

Overview

This connector allows access to the Jina Reader API using two modes:

  • "Reader" Mode (https://r.jina.api) - Given a set of one or more URLs as input, return the content of those pages as Markdown text. The Reader endpoint extracts the core content from a URL and converting it into clean, LLM-friendly text, ensuring high-quality input for your agent and RAG systems.
  • "Search" Mode (https://s.jina.api) - Similar to the reader endpoint, but accepting a search prompt and returning the text from top 5 search results. Both of these API endpoints will generate human readable markdown, which can also be efficiently processed by downstream LLM and GenAI applications. Both modes can be utilized in the same sync, following the configuration instructions below.

Available Streams

Read output is based on input content, but the json format doesn't differ in the response, Example:

In the above links, replace the substring after base url https://r.jina.ai/ or https://s.jina.ai/ with the url or search prompt to get the results

If there are more endpoints you'd like to support, please Create an issue.

Features

FeatureSupported?
Full Refresh SyncYes
Incremental SyncNo
SSL connectionYes
NamespacesNo

Getting started

Requirements

  • Jina AI Bearer Token (For higher rate limits)
  • Reader URL
  • Search prompt

Setup guide

Goto https://jina.ai/reader/#apiform for the complete guide about different pricing and tokens for that. The website also provides a free bearer token for testing with its interface.

Reference

Config fields reference

Field
Type
Property name
string
api_key
string
read_prompt
string
search_prompt
Set this as true for creating "Buttons & Links" section at the end. This helps the downstream LLMs or web agents navigating the page or take further actions.
boolean
gather_links
Set this as true for creating "Images" section at the end. This gives the downstream LLMs an overview of all visuals on the page, which may improve reasoning.
boolean
gather_images

Changelog

Expand to review
VersionDatePull RequestSubject
0.1.22024-07-0640880Update dependencies
0.1.12024-06-2540359Update dependencies
0.1.02024-06-2539515Add Jina AI source