docs
Actions
What is action?

Actions

Actions allow you to specify post-processing operations that should be performed on your crawled data. They enable you to automate the entire workflow from crawling to storage or further processing.

Available Actions

Currently, the following actions are supported:

  • S3 Upload: Upload the crawled data directly to Amazon S3 or any S3-compatible storage service.

Usage

To use actions, include an actions array in your crawl request. Each action is an object with a type field that specifies the action type, along with action-specific parameters.

{
  "url": "https://example.com",
  "scrape_type": "cleaned",
  "actions": [
    {
      "type": "upload_s3",
      // action-specific parameters
    }
  ]
}

Multiple Actions

You can specify multiple actions to be performed sequentially. For example, you could upload the data to multiple storage locations.

{
  "url": "https://example.com",
  "scrape_type": "cleaned",
  "actions": [
    {
      "type": "upload_s3",
      // S3 parameters
    },
    {
      "type": "another_action",
      // parameters for another action
    }
  ]
}

Action Results

For details on specific actions and their parameters, refer to the individual action documentation pages.