April 13, 2025

🦜🔗 Introducing WebcrawlerAPI LangChain Integration 🤖

We're thrilled to announce the release of our official LangChain integration! The new webcrawlerapi-langchain package makes it seamless to incorporate WebcrawlerAPI's powerful web crawling capabilities into your LangChain document processing pipelines.

Key Features:

🚀 Simple integration with LangChain's document loaders
📄 Multiple content formats (markdown, cleaned text, HTML)
⚡️ Async and lazy loading support
🔄 Built-in retry mechanisms, proxies and error handling
🎯 Configurable URL filtering with regex patterns

Quick Start:

pip install webcrawlerapi-langchain

from webcrawlerapi_langchain import WebCrawlerAPILoader

loader = WebCrawlerAPILoader(
    url="https://example.com",
    api_key="your-api-key",
    scrape_type="markdown"
)
documents = loader.load()

Perfect for:

Building AI-powered knowledge bases
Creating document QA systems
Training custom language models
Processing web content for LLM applications

Need an integration example?

Check our WebcrawlerAPI examples

Check out our LangChain SDK documentation for detailed usage instructions and examples. Start building powerful AI applications with web data today!