Channel #semsol: Logs

This is a public chat log generated from the #semsol IRC channel.

14:08:50 vicman: can anyone help me with a question please?
14:11:24 vicman: is there a way to parse rdfa from a text instead a web url with arc?
14:12:12 bengee: yes
14:13:05 bengee: when you look at the example at http://arc.semsol.org/docs/v2/extractors
14:13:49 bengee: you can do $parser->parse('http://example.com/', $your_html_snippet)
14:14:13 bengee: where $your_html_snippet is the data as string variable
14:14:32 vicman: and what's the url?
14:14:51 bengee: that's the graph (or base) URL used for relative paths
14:15:16 bengee: you can set it to anything you want
14:15:48 bengee: if you don't have relative URL values in your HTML, it will have no effect
14:16:09 vicman: amazing
14:16:17 vicman: that is what i need
14:16:32 vicman: thank you very much
14:17:36 bengee: I hope I remember it correctly ;)
14:18:12 vicman: :)
14:19:16 bengee: ah, one note: $your_html_snippet must be a single node
14:19:55 bengee: i.e. either a complete <html>...</html> or some other single-root-node markup
14:20:41 bengee: e.g. in case of "<div>...</div><span>...</span>" only the first node (i.e. the div) will be parsed
14:21:14 vicman: wow this is an important note
14:21:35 bengee: if you have sibling nodes, just wrap them in a container div
14:21:36 vicman: that can avoid a few headaches
14:21:55 vicman: thanks again
14:22:00 bengee: np