[Mulgara-general] Help
Bill OConnor
wtoconnor at gmail.com
Wed Jul 30 13:32:39 PDT 2008
Great idea. It looks to be another malformed URI with Hebrew in the rdf:ID.
What would be a way to handle this? It seems the open directory project is
a little out of spec with the current RDF.
Caused by: org.mulgara.query.QueryException: java.lang.Error: ARP generated
a malformed URI
at org.mulgara.content.rdfxml.Parser.toLocalNode(Parser.java:509)
at org.mulgara.content.rdfxml.Parser.statement(Parser.java:370)
at com.hp.hpl.jena.rdf.arp.ARPResource.tripleRes(ARPResource.java:184)
at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:169)
at
com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
at com.hp.hpl.jena.rdf.arp.ARPResource.setType(ARPResource.java:141)
at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:954)
at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
at com.hp.hpl.jena.rdf.arp.RDFParser.root(RDFParser.java:1371)
at com.hp.hpl.jena.rdf.arp.RDFParser.embeddedFile(RDFParser.java:1364)
at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:312)
at com.hp.hpl.jena.rdf.arp.ARP.load(ARP.java:291)
at org.mulgara.content.rdfxml.Parser.run(Parser.java:308)
Caused by: org.mulgara.query.QueryException: java.net.URISyntaxException:
Illegal character in fragment at index 79:
file:/home/wto/Projects/dmoz/fixedcontent.rdf#Top__Adult__World__Hebrew__גלריות
תמונות וסרטים
at java.net.URI$Parser.fail(URI.java:2825)
at java.net.URI$Parser.checkChars(URI.java:2998)
at java.net.URI$Parser.parse(URI.java:3044)
at java.net.URI.<init>(URI.java:595)
at org.mulgara.content.rdfxml.Parser.toLocalNode(Parser.java:506)
... 14 more
On Wed, Jul 30, 2008 at 12:02 PM, Paul Gearon <gearon at ieee.org> wrote:
> On Wed, Jul 30, 2008 at 1:13 PM, Bill OConnor <wtoconnor at gmail.com> wrote:
> > Hello,
> >
> > I finally think that I've overcome most of the problems with the dmoz
> RDF. I
> > wrote a small script that goes through the file and fixes the RDF errors.
> It
> > is a rather large file (2GB).
> > I cannot validate the entire thing since the only validation available
> seems
> > to be a service
> > rather a then a more useful program (does anybody know of a validator
> that
> > is open source?).
> >
> > I validated a small chunk which also seems to load into mulgara ok. When
> I
> > try to do
> > the entire file I get;
> >
> > INFO [main] (DataInputTx.java:97) - Sending local resource :
> > file:/home/wto/Projects/dmoz/fixedcontent.rdf
> > Caught: org.mulgara.query.QueryException: Unable to load
> > file:/home/wto/Projects/dmoz/fixedcontent.rdf into
> > rmi://localhost/server1#content2
> > at itql.run(itql.groovy:17)
> > at itql.main(itql.groovy)
> >
> > Questions:
> >
> > Is there some type of file size limit on the load?
>
> No, but for something as large as that you'll want to be on a 64 bit
> machine (and running Java in 64 bit mode).
>
> The error isn't telling me a lot. Is there some way to have the Groovy
> print some more info? The stack trace and any chained exceptions
> would really help me pin down why it failed for you (ie. something at
> your end, or at our end).
>
> Paul
> _______________________________________________
> Mulgara-general mailing list
> Mulgara-general at mulgara.org
> http://mulgara.org/mailman/listinfo/mulgara-general
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mulgara.org/pipermail/mulgara-general/attachments/20080730/a164add6/attachment.html
More information about the Mulgara-general
mailing list