Scrape Tool
Extracts text or attribute values from the current page using a selector (and optional attribute), returning the matched values.
Arguments
Name | Type | Description |
---|---|---|
task | DownloadTask | Required. A task from the previous Start or Crawl tool response |
selector | String | Required. Selector for getting interesting data on a web page |
attributeName | String | Optional. Attribute name to get data from. Use val to get inner text. Default value: val |
Remarks
The selector argument is a selector of the following format: CSS|XPATH: selector
. The first part defines the selector type, the second one should be a selector in the corresponding type.
Supported types:
DownloadTask
Represents a single page download request produced by a crawl or scrape job.
Fields:
Name | Type | Description |
---|---|---|
Id | String | Required. Task Id |
Url | String | Required. Page URL |
Return Type
Array of String