Type Definition
Options Reference
Required Options
| Option | Type | Description |
|---|---|---|
urls | string[] | Array of URLs to scrape |
Output Options
| Option | Type | Default | Description |
|---|---|---|---|
formats | Array<"markdown" | "html"> | ["markdown"] | Output formats to include |
Content Extraction Options
| Option | Type | Default | Description |
|---|---|---|---|
onlyMainContent | boolean | true | Extract only main content, removing nav/header/footer |
includeTags | string[] | [] | CSS selectors for elements to keep |
excludeTags | string[] | [] | CSS selectors for elements to remove |
removeAds | boolean | true | Remove ad and tracking elements |
removeBase64Images | boolean | true | Remove base64-encoded images |
Request Configuration Options
| Option | Type | Default | Description |
|---|---|---|---|
userAgent | string | undefined | Custom user agent string |
headers | Record<string, string> | undefined | Custom headers for requests |
timeoutMs | number | 30000 | Request timeout per page (ms) |
waitForSelector | string | undefined | CSS selector to wait for before extraction |
skipTLSVerification | boolean | true | Skip TLS/SSL certificate verification |
URL Filtering Options
| Option | Type | Default | Description |
|---|---|---|---|
includePatterns | string[] | [] | URL patterns to include (regex) |
excludePatterns | string[] | [] | URL patterns to exclude (regex) |
Batch Processing Options
| Option | Type | Default | Description |
|---|---|---|---|
batchConcurrency | number | 1 | URLs to process in parallel |
batchTimeoutMs | number | 300000 | Timeout for entire batch (ms) |
maxRetries | number | 2 | Max retry attempts for failed URLs |
onProgress | function | undefined | Progress callback |
Proxy Options
| Option | Type | Default | Description |
|---|---|---|---|
proxy | ProxyConfig | undefined | Proxy configuration |
Debugging Options
| Option | Type | Default | Description |
|---|---|---|---|
verbose | boolean | false | Enable verbose logging |
showChrome | boolean | false | Show browser window |

