Channel #semsol: Logs

This is a public chat log generated from the #semsol IRC channel.

14:57:49 mib_twtmws: Hi, I have a question about arc load
15:01:43 bengee: just ask :)
15:03:30 mib_twtmws: thanks
15:03:57 mib_twtmws: the thing is that I want to upload a big rdf file into the mysql-based store
15:04:19 mib_twtmws: but there ir no way I can do it using the sparql+ endpoint
15:04:47 mib_twtmws: so I was wondering if there is another way to do it or how to "translate" rdf tripls to sql inserts for the given schema
15:05:25 bengee: LOAD <path/to/file> doesn't work?
15:06:58 mib_twtmws: it does work BUT there isn't enough memory to load the while graph before loading it
15:07:29 bengee: ARC should stream-load with constant mem
15:07:51 mib_twtmws: I can't do such a big use memory and I don't have control over the php configuration
15:08:14 bengee: do any triples end up in the mysql tables at all?
15:08:29 bengee: you can reduce the insert batch size
15:09:18 bengee: .. by specifying a low "store_write_buffer" in the configuration
15:09:27 bengee: default is 2500
15:09:56 bengee: that's the number of triples that are buffered in memory before they are sent to mysql
15:10:42 mib_twtmws: yes, there are some inserts but not so much
15:10:48 mib_twtmws: I will try again
15:11:05 bengee: the batch config is available from revision 2009-03-04
15:11:56 bengee: I've been able to load 3MT with PHP memory set to 8MB
15:11:58 mib_twtmws: ok, I'll check that
15:12:09 bengee: is the input doc RDF/XML?
15:12:23 mib_twtmws: yes
15:12:26 bengee: ok
15:12:42 mib_twtmws: i was wondering if it would be better to convert it to ntriples
15:13:03 bengee: rdf/xml is safest, stream-wise
15:14:28 bengee: the ntriples/turtle parser sometimes doesn't find delimiters in non-trivial markup and then fills up the memory
15:14:50 mib_twtmws: ok, I see