Description
n8n-nodes-cheerio
!npm version
!License
!CI
An n8n community node for parsing and extracting data from HTML using Cheerio.
What is n8n?
n8n is a fair-code licensed workflow automation platform that allows you to connect various services and automate tasks.
Installation
Follow the installation guide in the n8n documentation to install this community node.
Operations
The Cheerio node allows you to:
- Extract text content using CSS selectors
- Extract HTML content using CSS selectors
- Extract element attributes
- Remove unwanted elements before parsing
- Process multiple selectors in a single operation
- Return single items or arrays of matched elements
- HTML: The HTML content to parse
- Selectors: One or more selector configurations
- Remove Elements: (Optional) CSS selectors for elements to remove before parsing
results: Extracted data organized by selector namestotalElements: Count of all matched elementsselectors: Number of selectors processed
Configuration
Inputs
– Name: Identifier for the extracted data
– CSS Selector: Standard CSS selector syntax
– Attribute: (Optional) Specific attribute to extract
– Trim Text: Remove whitespace from results
– Return Single Item: Get only the first match
– Return HTML: Get HTML instead of text content
Output
Returns a JSON object with:
Usage Example
{
"html": "Product Name
$99",
"selectors": [
{
"name": "productName",
"selector": "h2",
"singleItem": true
},
{
"name": "price",
"selector": ".price",
"singleItem": true
}
]
}
Result:
{
"results": {
"productName": "Product Name",
"price": "$99"
},
"totalElements": 2,
"selectors": 2
}
Resources
Development
Install dependencies
npm installBuild the node
npm run buildRun tests
npm test