CrawlMdrConfig Tools

Build and update multi‑level crawl/scrape plans: define tree structure, link selectors, and field extraction rules for complex extractions.

Each tool returns a new or modified CrawlMdrConfig object. The returned CrawlMdrConfig object is passed to the next tool call as a required input parameter.

CrawlMdrConfigCreate

Creates a new empty CrawlMdrConfig object with path /.

Arguments

None

CrawlMdrConfigUpsertSub

Adds or updates a child level and the transition crawl parameters to reach it.

Remarks

The selector argument is a selector of the following format: CSS|XPATH: selector. The first part defines the selector type, the second one should be a selector in the corresponding type. Supported types:

Arguments

Name Type Description
crawlMdrConfig Object Required. MDR configuration object from the previous tool call
path String Required. Path to a level in the MDR tree. It should start with / and contain at least one step. Each step is separated by /. Path must not end with /
selector String Required. Selector for getting interesting links on a web page
attributeName String Optional. Attribute name to get data from. Use val to get inner text. Default value: href

CrawlMdrConfigUpsertCrawlParams

Adds or updates link selectors for a specific MDR level.

Arguments

Name Type Description
crawlMdrConfig Object Required. MDR configuration object from the previous tool call
path String Required. Path to a level in the MDR tree. It should start with / and contain at least one step. Each step is separated by /. Path must not end with /
selector String Required. Selector for getting interesting links on a web page
attributeName String Optional. Attribute name to get data from. Use val to get inner text. Default value: href

CrawlMdrConfigUpsertScrapeParams

Adds or updates a field’s selector/attribute for a specific MDR level.

Arguments

Name Type Description
crawlMdrConfig Object Required. MDR configuration object from the previous tool call
path String Required. Path to a level in the MDR tree. It should start with / and contain at least one step. Each step is separated by /. Path must not end with /
fieldName String Required. Name of a data field that will contain scraped data according to the provided selector and attribute name.
selector String Required. Selector for getting interesting data on a web page
attributeName String Optional. Attribute name to get data from. Use val to get inner text. Default value: val

Please rotate your device to landscape mode

This documentation is specifically designed with a wider layout to provide a better reading experience for code examples, tables, and diagrams.
Rotating your device horizontally ensures you can see everything clearly without excessive scrolling or resizing.

Return to Web Data Source Home