Back to Changelog

🦜🔗 Introducing WebcrawlerAPI LangChain Integration 🤖

🦜🔗  Introducing WebcrawlerAPI LangChain Integration 🤖

We're thrilled to announce the release of our official LangChain integration! The new webcrawlerapi-langchain package makes it seamless to incorporate WebcrawlerAPI's powerful web crawling capabilities into your LangChain document processing pipelines.

Key Features:

  • 🚀 Simple integration with LangChain's document loaders
  • 📄 Multiple content formats (markdown, cleaned text, HTML)
  • ⚡️ Async and lazy loading support
  • 🔄 Built-in retry mechanisms, proxies and error handling
  • 🎯 Configurable URL filtering with regex patterns

Quick Start:

pip install webcrawlerapi-langchain
from webcrawlerapi_langchain import WebCrawlerAPILoader

loader = WebCrawlerAPILoader(
    url="https://example.com",
    api_key="your-api-key",
    scrape_type="markdown"
)
documents = loader.load()

Perfect for:

  • Building AI-powered knowledge bases
  • Creating document QA systems
  • Training custom language models
  • Processing web content for LLM applications

Need an integration example?

Check our WebcrawlerAPI examples

Check out our LangChain SDK documentation for detailed usage instructions and examples. Start building powerful AI applications with web data today!