We're thrilled to announce the release of our official LangChain integration! The new webcrawlerapi-langchain
package makes it seamless to incorporate WebcrawlerAPI's powerful web crawling capabilities into your LangChain document processing pipelines.
Key Features:
- 🚀 Simple integration with LangChain's document loaders
- 📄 Multiple content formats (markdown, cleaned text, HTML)
- ⚡️ Async and lazy loading support
- 🔄 Built-in retry mechanisms, proxies and error handling
- 🎯 Configurable URL filtering with regex patterns
Quick Start:
pip install webcrawlerapi-langchain
from webcrawlerapi_langchain import WebCrawlerAPILoader
loader = WebCrawlerAPILoader(
url="https://example.com",
api_key="your-api-key",
scrape_type="markdown"
)
documents = loader.load()
Perfect for:
- Building AI-powered knowledge bases
- Creating document QA systems
- Training custom language models
- Processing web content for LLM applications
Need an integration example?
Check our WebcrawlerAPI examples
Check out our LangChain SDK documentation for detailed usage instructions and examples. Start building powerful AI applications with web data today!