Sim

Jina

Convert website content into text

Jina AI is a powerful content extraction tool that seamlessly integrates with Sim to transform web content into clean, readable text. This integration allows developers to easily incorporate web content processing capabilities into their agentic workflows.

Jina AI Reader specializes in extracting the most relevant content from web pages, removing clutter, advertisements, and formatting issues to produce clean, structured text that's optimized for language models and other text processing tasks.

With the Jina AI integration in Sim, you can:

  • Extract clean content from any web page by simply providing a URL
  • Process complex web layouts into structured, readable text
  • Maintain important context while removing unnecessary elements
  • Prepare web content for further processing in your agent workflows
  • Streamline research tasks by quickly converting web information into usable data

This integration is particularly valuable for building agents that need to gather and process information from the web, conduct research, or analyze online content as part of their workflow.

Usage Instructions

Transform web content into clean, readable text using Jina AI's advanced extraction capabilities. Extract meaningful content from websites while preserving important information and optionally gathering links.

Tools

jina_read_url

Extract and process web content into clean, LLM-friendly text using Jina AI Reader. Supports advanced content parsing, link gathering, and multiple output formats with configurable processing options.

Input

ParameterTypeRequiredDescription
urlstringYesThe URL to read and convert to markdown
useReaderLMv2booleanNoWhether to use ReaderLM-v2 for better quality
gatherLinksbooleanNoWhether to gather all links at the end
jsonResponsebooleanNoWhether to return response in JSON format
apiKeystringYesYour Jina AI API key

Output

ParameterType
contentstring

Block Configuration

Input

ParameterTypeRequiredDescription
urlstringYesURL - Enter URL to extract content from

Outputs

OutputTypeDescription
contentstringcontent output from the block

Notes

  • Category: tools
  • Type: jina
Jina