HTML Extract
processing.html-extract Processing v0.1.0 Extracts values from an HTML string with CSS selectors — text, inner HTML, or an attribute per rule, first match or all matches. Pair with HTTP Request to scrape a page (this node does not fetch).
Ports
| Direction | Port | Label |
|---|---|---|
| Input | input | HTML |
| Output | output | Extracted |
Properties
| Property | Type | Default | Description |
|---|---|---|---|
Source fieldsourceField | string | "" | Dot-path to the HTML string on the input. Leave blank when the payload itself is the HTML. |
Extractionsextractions | rows | [{"field":"","selector":"","attr":"","multiple":false}] | One output key per row: a CSS selector (tag, *, #id, .class, [attr], [attr=v], ^= $= *=, compounds, descendant, >, comma groups, :first-of-type, :nth-of-type(n)), what to take (text, html, or an attribute name), and first match or all matches. |
Trim valuestrim | boolean | true | Trim whitespace around extracted values. |
This page is generated from the node registry by gen-node-docs.mjs on every
site build — ports, properties, defaults and visibility rules cannot drift from the code.