Channel #semsol: Logs

This is a public chat log generated from the #semsol IRC channel.

11:28:08 danbri: bengee, i posted the loader script I'm using: http://danbri.org/words/2008/01/03/243
11:30:17 bengee: oh, cool
11:30:34 bengee: I have that wordpress plugin almost working, too
11:30:52 bengee: and figured out how to define sidebar widgets
11:33:05 danbri: i
11:33:21 danbri: plugins seem to be interacting and screwing things up, including somehow my feeds for planetrdf
11:33:27 danbri: but i'll be back :)
11:33:54 danbri: i wanted to ask about your smushing stuff: does it change the graph(s), or leave untouched?
11:34:33 bengee: it changes the internal identifiers
11:35:37 bengee: i.e. it doesn't directly change the graph data, but queries return different results after smushing
11:36:52 bengee: values are not lost
11:37:08 bengee: hmm, that's a bit ricky to explain..
11:37:42 bengee: there is a table that maps values (iris, literals, bnode ids) to an internal ID
11:38:09 danbri: so its like its inferring a private graph of sameAs relations?
11:38:25 danbri: the ID changing being your implementation trick for recording that...?
11:38:30 bengee: those mapping tables have (val, id, cid) where cid is "consolidated id"
11:39:24 bengee: the sql rewriter uses "cid" instead of "id", which are the same initially, but may be different after smushing
11:39:28 danbri: can you back out after smushing on dodgy data?
11:40:18 bengee: this allows you to keep using the URI or bnode ids you are used to in your queries, but still let ARC use consolidated IDs internally
11:40:55 danbri: that's cool
11:42:19 bengee: the undoConsolidation() method is still missing, I just discovered...
11:42:58 bengee: it'd just be a SET cid = id, I should add that..
11:43:23 bengee: you can currently reset the cids by re-loading a graph
11:43:58 danbri: slightly related in #jena just now, Andy just said:
11:43:59 danbri: AndyS: Blank nodes are going to be tricky. Mulgara does not provide stable blank nodes across transactions. Writing an RDF editor is "challenging". Or don
11:44:10 danbri: talking about the bridge to mulgara store they're working on
11:44:24 bengee: ah
11:44:33 danbri: what commitments does arc2 make re bnode id longevity?
11:45:00 bengee: that id/cid trick allowed to me implement bnode editing in that editing thingy
11:45:26 bengee: bnode ids are stable as long as the graph is not re-loaded
11:45:28 danbri: if the smush goes wrong, cid can just be restored to id?
11:45:36 bengee: yes
11:45:41 bengee: (I hope) ;)
11:45:54 danbri: cases where smush goes wrong: people using group weblog URIs in foaf:weblog
11:46:10 danbri: or if we'd made foaf:img inverse functional, and lots of sites had a default image url that ended up in it
11:46:13 danbri: etc
11:46:25 bengee: the inferencer ist still under construction
11:47:31 bengee: something is borken, it used to work nicely on twitter, but I tried recently, and the results loked wrong
11:47:34 bengee: looked
11:48:03 danbri: do you have twitter buddylist dumps?
11:48:33 bengee: no dumps
11:48:47 danbri: or service or mapping or whatever...
11:48:52 danbri: ok what are you talking about? :)
11:49:26 bengee: ARC supports LOAD <http://twitter.com/danbri>
11:49:43 bengee: it'll auto-detec the format and extract xfn and hcard as RDF
11:49:52 danbri: ah :)
11:51:15 danbri: danbri does
11:51:20 danbri: [danbri@ip-10-251-46-68 foaf]$ ./dataloader http://twitter.com/danbri
11:51:20 danbri: Loading data from http://twitter.com/danbri
11:51:37 bengee: bengee ducks
11:51:57 bengee: it might break your multi-graph query due to join overload...
11:53:06 danbri: how would it break it?
11:53:09 danbri: SELECT DISTINCT ?p WHERE { GRAPH <http://twitter.com/danbri> { ?s ?p ?o } }
11:53:09 danbri: btw doing
11:53:21 danbri: ... I don't see anything like knows/friend/follows
11:54:11 bengee: there should be xfn:contact
11:55:11 danbri: not seeing it in http://sandbox.foaf-project.org/2008/foaf/ggg.php
11:56:15 bengee: the last two are xfn:me and xfn:contact here
11:57:52 danbri: doh
11:57:55 danbri: thanks
11:59:40 danbri: do we have any handy sources of xfn data where more exotic relationship types might be used?
11:59:54 danbri: i wanted to play with ways of defining a foaf:Group by relationship types
11:59:56 bengee: oh, the inferencer does more updates than just the cid column
12:00:01 danbri: eg. workplaceHomepage of www.w3.org
12:00:08 danbri: or xfn or family tree relations
12:02:04 bengee: something on http://microformats.org/wiki/hcard-xfn-friends-lists maybe?
12:03:40 bengee: but the services probably just use the standard props
12:09:23 danbri: i guess wordpress has built support, i could do some of my own friends?
12:09:49 danbri: have you ever looked at d2rq-ish stuff? ie wrapping sparql around vanilla sql tables
12:10:13 danbri: reason i ask is issue of crawling from wordpress (sql) into wordpress (rdf sql) ... some scope for redundancy
12:10:48 bengee: not too much, although it's interesting stuff
12:10:51 danbri: ie. a wordpress installation could be serialized out as a lot of RDF, and then loaded back into an ARC-augmented wordpress (possibly the same one, or possibly one of a neighbour, colleague...)
12:11:13 danbri: ...whereas if we had a d2rq'y thing, maybe can minimise that in the case where it would be stored back in itself
12:11:19 danbri: premature optimisation tho :)
12:11:27 danbri: its cool technology
12:14:04 bengee: there is an r2d2 for rap, iirc
12:18:44 danbri: oh, interesting, i'd forgotten that
12:18:49 danbri: wonder how tightly coupled it is
13:25:05 bengee: bengee wonders how/where to make the endpoint available in the wordpress plugin
13:26:13 bengee: doesn't seem to be easy to create a dynamic page outside of wp-admin
13:45:49 bengee: ah, "template_redirect"
13:54:43 danbri: re the endpoint, ... could you make the default html form selection be "html table" not "default"?
13:55:17 bengee: html table is only available for selects
13:55:25 danbri: ah fair enough
13:55:33 danbri: also - is it possible to use GET instead of POST as an option? I guess so...?
13:55:47 bengee: the endpoint form is really just a very basic tool
13:55:58 bengee: th first version didn't even have the format selector
13:56:04 danbri: and re getting JSON results ... is there an arg for wrapping the json in a function, so it can be used in remote script tags?
13:56:20 bengee: GET works for the read-only calls
13:57:18 bengee: the format arg is "format", alternatively, you can use conneg
13:57:35 bengee: e.g. &format=json
13:57:43 danbri: but that returns pure json
13:57:59 bengee: ah, you mean somehing like jsonp
13:58:05 bengee: sorry, misunderstood
13:58:18 danbri: so random pages can load it via <script
13:58:32 danbri: ...it needs my_callback( json_goes_here ) ...
13:58:37 bengee: looks like I removed that functionality, hmm...
13:59:02 bengee: I had that in ARC2, including indexed json and stuff
13:59:06 bengee: er in ARC1
14:00:03 bengee: should be easy to re-add
14:01:52 mortenf: mortenf waves
14:02:07 bengee: mortenf!
14:02:20 mortenf: just downloaded arc2 preview
14:02:35 mortenf: saw you're into wordpress stuff atm :)
14:02:39 bengee: I hope it works with your mysql now
14:03:02 mortenf: mortenf tries to remember...
14:03:28 bengee: ARC1 was 4.0.18 or higher, you had something earlier
14:03:39 mortenf: right
14:03:47 mortenf: i think i fixed it
14:03:56 mortenf: but great, did see the req.
14:03:57 bengee: I added vesion detection now to make it work
14:04:00 mortenf: :)
14:04:14 danbri: morten!
14:04:20 mortenf: mortenf ought to get around to getting a new box and do ubuntu...
14:04:24 mortenf: hey dan
14:04:40 bengee: and I blame danbri for the wordpress(ure)
14:04:49 danbri: sorry :)
14:04:50 mortenf: heh, good job dan!
14:05:01 danbri: re new box ... i've been playing with ec2 this week
14:05:06 danbri: as many new boxes as you want!
14:05:11 mortenf: yeah, saw that
14:05:24 mortenf: right, not sure of the implications though
14:05:27 danbri: i've spent about 6 usd so far
14:05:51 danbri: implications -- yeah, the service is realtively immature
14:05:57 danbri: but i like the way things are going
14:06:07 danbri: and much more flexibility than most hosting companies
14:06:15 mortenf: complicated: money, access, performance etc.
14:07:16 danbri: access? if things go wrong you mean?
14:07:18 mortenf: mortenf is about to write an arc2 version of http://www.wasab.dk/morten/blog/archives/2005/01/04/exchange-of-named-rdf-graphs
14:07:19 bengee: yay, I got that endpoit working via WP
14:07:28 mortenf: yeah, mostly, i guess
14:07:45 mortenf: also, of course, there's the hw geeky factor :)
14:07:47 danbri: oh, cool idea
14:07:49 mortenf: yay!
14:08:14 danbri: i've been bugging libby to get her old codepiction data back online
14:08:22 mortenf: but maybe i'm just old fashioned
14:08:36 danbri: you like to own your own server?
14:08:46 mortenf: i think :)
14:09:07 mortenf: of course, just spent holidays replacing failed drive, so...
14:09:35 mortenf: raid1d rules...
14:09:56 bengee: named graph exchange sounds great
14:10:26 mortenf: would make it easier for me to test (aka stress) arc2 too :)
14:10:37 bengee: oh, shit
14:10:41 bengee: ;)
14:10:42 mortenf: heh
14:11:01 danbri: I need to buy some raid kit
14:11:12 danbri: i have a mountain of corpsed or suspect disks lying around
14:11:38 danbri: wonder if there's a ruby client library for sparql protocol...
14:13:59 mortenf: bengee: #semsol logs available as rdf somehow?
14:14:17 mortenf: ... to plug into my irc log watcher :)
14:14:50 bengee: there's a sparql endpoint at http://arc.semsol.org/community/irc/sparql
14:15:22 mortenf: ah, nice
14:15:53 bengee: didn't have the time to create dumps
14:16:11 bengee: even the html logs are horribly ugly
14:17:18 mortenf: this is quite allright, i think
14:17:46 bengee: bengee wonders if hooking into template_redirect is the proper way to add plugin-driven pages to WP
14:17:52 danbri: if literals like in http://danbri.org/2008/xss-rdf/foaf.rdf found their way into the mail or irc endpoints, would the website be compromised?
14:18:57 bengee: ... asks the man with a public DELETE-enabled endpoint ;)
14:19:31 bengee: the IRC log should escape the html
14:20:23 mortenf: mortenf makes mental note to check up on that
14:20:33 bengee: not 100% sure how the triples are stored atm
14:20:46 mortenf: re template_redirect: not sure there really is a "proper" way
14:21:16 mortenf: i've used a filter on the_content for creating a form
14:21:37 bengee: ah
14:22:53 mortenf: i've used template_redirect for other content types though, so probably a good bet
14:23:02 bengee: ok
14:23:33 bengee: it's for the endpoint which doesn't really need other wp theme stuff
14:23:46 danbri: re DELETE; heh, it's all caches of data elsewhere
14:23:51 mortenf: btw, i have some unfinished arc1 work at http://bzr-dev.mfd-consult.dk/sparql-press/
14:24:03 mortenf: right, that i think it's good for
14:24:43 mortenf: feel free to glean and grab etc.
14:24:47 danbri: danbri remembers his list of Stuff To Ask Bengee from yesterday
14:24:54 danbri: how do we get plain text?
14:24:54 danbri: why was the endpoint a white page originally?
14:24:54 danbri: can we geet the callback arg thing for json results?
14:24:55 danbri: how do I smush? non destructively?
14:24:55 danbri: how is permissioning of updates for http://sandbox.foaf-project.org/2008/foaf/ggg.php handled?
14:25:07 bengee: oh, cool, mortenf
14:25:16 bengee: that'll be handy for the options page
14:25:48 danbri: oh, i don't remember seeing http://bzr-dev.mfd-consult.dk/sparql-press/
14:26:06 mortenf: note the -dev :)
14:26:24 danbri: did you announce it? did i bang my head and forget it?
14:26:50 mortenf: nope, and hope not
14:27:19 mortenf: i'm quote sure it's not ready for announcement, may break stuff etc...
14:27:40 danbri: have you seen the diso project?
14:27:41 mortenf: also, don't remember where i left it, might not even parse atm
14:27:46 mortenf: yep
14:28:08 danbri: i mentioned sparqlpress (my old wiki notes and recent hacking here) on their list yesterday
14:29:32 mortenf: mortenf looks
14:31:25 bengee: re xss in irc logs: the logger creates turtle from the message which is then sent to the store. a viewer has to make sure that the data is escaped if displayed as html
14:32:17 bengee: the trice module does it, I hope: <b>this is bold</b>
14:36:32 bengee: danbri, I
14:36:44 bengee: (the plugin is called "rdf-tools")
14:37:02 bengee: e.g. ?rdf-tools=endpoint gives you the endpoint
14:37:51 bengee: other stuff could be "?rdf-tools=xfn2foaf&url=..." etc
14:38:13 bengee: that's all without wp themeing, though
14:38:48 bengee: I'll try to create a config page in wp-admin now
14:40:26 bengee: re your qlist:
14:41:20 bengee: callback: will do that now
14:41:46 bengee: plain: you mean: don't send those damn headers?
14:44:08 danbri: its a little generic; maybe it helps if the url is related to the plugin name?
14:44:17 danbri: but sure, no biggie
14:45:27 bengee: not sure if I can catch a 404, so that I could make that /rdf-tools/endpoint or so
14:45:50 danbri: the <b> in http://arc.semsol.org/community/irc/logs/2008/01/03 is escaped
14:51:05 bengee: "plain" should work for SELECT at least
14:51:23 bengee: it'll print_r the result array
14:51:36 bengee: and sends text/plain
14:52:20 bengee: you can also comment out the sendHeaders() call in your endpoint script
14:52:42 bengee: I do that for debugging
14:56:38 danbri: plain was before we got html tables :)
14:56:51 mortenf: those were the days...
14:57:04 danbri: like yesterday :)
14:57:32 danbri: does arc let me turn off remote update, delete etc?
15:00:09 bengee: "endpoint_features" in the config lets you enable selected query types, but it
15:00:39 bengee: in the knowee prototype I added another thin layer which let you specify a password
15:01:15 bengee: or you could just create two endpoints to the same store, a public one for read-only operations, ad a secret or protected one for insert/load/delete
15:04:09 bengee: or just ... , "endpoint_features" => isset($_SERVER) ? array(
15:04:51 bengee: or rather isset($_SERVER) && isset($_SERVER['SERVER_NAME'])
15:05:17 bengee: SERVER_NAME is not available when the script is called from the cl
15:06:37 danbri: fixed
15:06:46 danbri: i just removed 'em for now
15:06:49 danbri: thanks!
15:10:34 mortenf: bengee, please insert #-anchors in html irc logs next time you update it :)
15:12:05 bengee: willl do
15:13:34 danbri: heading out to meet lib n damian...
15:13:37 danbri: take care...
15:15:05 mortenf: see you
15:39:22 bengee: bengee back from phone
15:40:34 bengee: .. and off for a bit
15:49:35 mortenf: mortenf parts...
18:50:31 bengee: bengee stops ranting about drupal's pre-processing overhead after seeing what wordpress is doing internally
18:58:43 bengee: but I found the 404 trigger, so I can offer you a "/rdf-tools/sparql" path for the endpoint now, danbri
19:38:45 bengee: ugh, and no form builder in wp..