[Mulgara-general] Help

Bill OConnor wtoconnor at gmail.com
Wed Jul 30 13:32:39 PDT 2008


Great idea. It looks to be another malformed URI with Hebrew  in the rdf:ID.

What would be a way to handle this? It seems the open directory project is
a little out of spec with the current RDF.

Caused by: org.mulgara.query.QueryException: java.lang.Error: ARP generated
a malformed URI
    at org.mulgara.content.rdfxml.Parser.toLocalNode(Parser.java:509)
    at org.mulgara.content.rdfxml.Parser.statement(Parser.java:370)
    at com.hp.hpl.jena.rdf.arp.ARPResource.tripleRes(ARPResource.java:184)
    at com.hp.hpl.jena.rdf.arp.ARPResource.triple(ARPResource.java:169)
    at
com.hp.hpl.jena.rdf.arp.ARPResource.setPredObject(ARPResource.java:146)
    at com.hp.hpl.jena.rdf.arp.ARPResource.setType(ARPResource.java:141)
    at com.hp.hpl.jena.rdf.arp.RDFParser.typedNode(RDFParser.java:954)
    at com.hp.hpl.jena.rdf.arp.RDFParser.description(RDFParser.java:429)
    at com.hp.hpl.jena.rdf.arp.RDFParser.obj(RDFParser.java:346)
    at com.hp.hpl.jena.rdf.arp.RDFParser.rdf(RDFParser.java:326)
    at com.hp.hpl.jena.rdf.arp.RDFParser.root(RDFParser.java:1371)
    at com.hp.hpl.jena.rdf.arp.RDFParser.embeddedFile(RDFParser.java:1364)
    at com.hp.hpl.jena.rdf.arp.ARPFilter.parse(ARPFilter.java:312)
    at com.hp.hpl.jena.rdf.arp.ARP.load(ARP.java:291)
    at org.mulgara.content.rdfxml.Parser.run(Parser.java:308)
Caused by: org.mulgara.query.QueryException: java.net.URISyntaxException:
Illegal character in fragment at index 79:
file:/home/wto/Projects/dmoz/fixedcontent.rdf#Top__Adult__World__Hebrew__גלריות
תמונות וסרטים
    at java.net.URI$Parser.fail(URI.java:2825)
    at java.net.URI$Parser.checkChars(URI.java:2998)
    at java.net.URI$Parser.parse(URI.java:3044)
    at java.net.URI.<init>(URI.java:595)
    at org.mulgara.content.rdfxml.Parser.toLocalNode(Parser.java:506)
    ... 14 more


On Wed, Jul 30, 2008 at 12:02 PM, Paul Gearon <gearon at ieee.org> wrote:

> On Wed, Jul 30, 2008 at 1:13 PM, Bill OConnor <wtoconnor at gmail.com> wrote:
> > Hello,
> >
> > I finally think that I've overcome most of the problems with the dmoz
> RDF. I
> > wrote a small script that goes through the file and fixes the RDF errors.
> It
> > is a rather large file (2GB).
> > I cannot validate the entire thing since the only validation available
> seems
> > to be a service
> > rather a then a more useful program (does anybody know of a validator
> that
> > is open source?).
> >
> > I validated a small chunk which also seems to load into mulgara ok. When
> I
> > try to do
> > the entire file I get;
> >
> >  INFO [main] (DataInputTx.java:97) - Sending local resource :
> > file:/home/wto/Projects/dmoz/fixedcontent.rdf
> > Caught: org.mulgara.query.QueryException: Unable to load
> > file:/home/wto/Projects/dmoz/fixedcontent.rdf into
> > rmi://localhost/server1#content2
> >     at itql.run(itql.groovy:17)
> >     at itql.main(itql.groovy)
> >
> > Questions:
> >
> > Is there some type of  file size limit on the load?
>
> No, but for something as large as that you'll want to be on a 64 bit
> machine (and running Java in 64 bit mode).
>
> The error isn't telling me a lot. Is there some way to have the Groovy
> print some more info?  The stack trace and any chained exceptions
> would really help me pin down why it failed for you (ie. something at
> your end, or at our end).
>
> Paul
> _______________________________________________
> Mulgara-general mailing list
> Mulgara-general at mulgara.org
> http://mulgara.org/mailman/listinfo/mulgara-general
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mulgara.org/pipermail/mulgara-general/attachments/20080730/a164add6/attachment.html 


More information about the Mulgara-general mailing list