Channel #semsol: Logs

This is a public chat log generated from the #semsol IRC channel.

23:00:09 chameleon_95: what times is he usually found here
23:00:16 kwijibo: the only developer
23:00:21 chameleon_95: ahuh
23:00:35 kwijibo: though some of us contribute patches and plugins from time to time
23:00:41 kwijibo: usually 9-5 weekdays
23:00:55 chameleon_95: i am guessing USA time
23:00:57 kwijibo: afaik
23:01:03 kwijibo: Europe time sorry
23:01:17 kwijibo: he's in Germany
23:01:29 chameleon_95: ok.. I am based in China.. I will be back tonight..
23:01:35 chameleon_95: Thks kwijibo
23:01:36 kwijibo: so approx -1 from GMT
23:01:43 kwijibo: np
23:01:54 kwijibo: how big is your dataset out of interest?
23:02:00 chameleon_95: 59M triples
23:02:12 kwijibo: wow
23:02:50 kwijibo: 128 m doesn't sound like enough probably
23:03:57 kwijibo: it would be interesting to see what bengee says: the parser streams, so perhaps memory shouldn't need to be an issue
23:04:19 kwijibo: however i have hit memory issues parsing large rdf docs in the past myself
23:04:48 chameleon_95: the parser is well written, the streaming is faultless
23:05:01 chameleon_95: its just the SQL reads and writes that are killing me
23:05:29 chameleon_95: I need to increase the keys in the buffer for all the tables to reduce the SQL reads
23:07:48 chameleon_95: for every triple insert there is about 6 reads and 4 writes, now multiply that by 59M
23:08:46 chameleon_95: kwijibo, do you know how to stream a bz2 file using arc with LOAD
23:10:44 kwijibo: no sorry :(
23:11:42 chameleon_95: np
23:11:45 chameleon_95: bbl