Ticket #598 (closed defect: fixed)

Opened 2 months ago

Last modified 7 weeks ago

nepomuk_testdata cannot export claudia_sterns data into trig file

Reported by: enricominack Owned by: leosauermann
Priority: normal Milestone: m36-redmeteor
Component: ---UNSPECIFIED--- Version:
Keywords: Cc:

Description

I freshly started NEPOMUK and crawled the documents folder of claudia_stern. Then I started nepomuk_testdata (http://dev.nepomuk.semanticdesktop.org/repos/trunk/java/nepomuk_testdata/) to export the crawled data into a trig file, but it failed for the context

  file:/C:/home/claudia_stern/documents/SAP/q4_2003_final.ppt

with the exception

[Fatal Error] :73:25: An invalid XML character (Unicode: 0xb) was found in the element content of the document.
Exception in thread "main" java.lang.NullPointerException
	at org.openrdf.rio.RDFParseException.<init>(RDFParseException.java:81)
	at org.openrdf.rio.helpers.RDFParserBase.reportFatalError(RDFParserBase.java:532)
	at org.openrdf.rio.helpers.RDFParserBase.reportFatalError(RDFParserBase.java:508)
	at org.openrdf.rio.trix.TriXParser.parse(TriXParser.java:114)
	at org.openrdf.rio.trix.TriXParser.parse(TriXParser.java:76)
	at org.openrdf.http.client.HTTPClient.getRDF(HTTPClient.java:1039)
	at org.openrdf.http.client.HTTPClient.getStatements(HTTPClient.java:532)
	at org.openrdf.repository.http.HTTPRepositoryConnection.exportStatements(HTTPRepositoryConnection.java:155)
	at org.openrdf.repository.http.HTTPRepositoryConnection.getStatements(HTTPRepositoryConnection.java:141)
	at org.openrdf.rdf2go.RepositoryModelSet.addModel(RepositoryModelSet.java:358)
	at org.semanticdesktop.nepomuk.testdata.StoreCrawledDataFromNepomukToFile.main(StoreCrawledDataFromNepomukToFile.java:80)

Attachments

patch_ticket_598.patch (5.7 kB) - added by leosauermann 2 months ago.

Change History

Changed 2 months ago by enricominack

  • type changed from task to defect

Changed 2 months ago by leosauermann

tried to fix it using XmlSafetyUtils?.wrapXmlSafeRDFContainer(container) but this fails miserably because of fucking classloader problems. tried a lot, failed miserably after 4 hours, can't fix it, sorry.

my state is attached as patch.

Changed 2 months ago by leosauermann

Changed 7 weeks ago by leosauermann

  • status changed from new to closed
  • resolution set to fixed

fixed in changeset:8926 by filtering out this unicode in ApertureDataWrapper. Still, the bad unicode can be added by anyone malicious.

Note: See TracTickets for help on using tickets.