Details

      Description

      Add support for nquads and submit a patch to Sesame for this integration. This will allow us to process the billion triple challenge data sets, which will be good for performance tests, and also the bio2rdf data sets, some of which are quite large.

      The nxparser package is available under a mixture of BSD and apache licenses from [1]. The BTC data set is available from [2,3]. A small pubmed (aka medline) nquads data set is available from [4].

      Note: The nxparser package has a problem with compiling the unit tests which I have not resolved. However, Andreas Harth begin_of_the_skype_highlighting?????end_of_the_skype_highlighting assures me that it is quite stable. There is also a dependency on junit, which is not bundled. nxparser may also share some dependencies with bigdata, in which case we should attempt to converge the versions for those dependencies.

      [1] http://sw.deri.org/svn/sw/2006/08/nxparser
      [2] http://vmlion25.deri.ie/index.html
      [3] http://vmlion25.deri.ie/btc-2009-small.nq.gz
      [4] http://download.bio2rdf.org/data/pubmed/sample/medline09n0001.xml.zip.nq.gz

        Activity

        Hide
        bryanthompson bryanthompson added a comment -

        Support for nquads has been added in the trunk based on the nxparser package. The integration currently uses a modified nxparser.jar based on nxparser-1.1. The only change was to add another NxParser constructor which accepted a Reader. The existing constructors all require InputStreams, but bigdata tends to invoke the RDFParserFactory with a Reader.

        Show
        bryanthompson bryanthompson added a comment - Support for nquads has been added in the trunk based on the nxparser package. The integration currently uses a modified nxparser.jar based on nxparser-1.1. The only change was to add another NxParser constructor which accepted a Reader. The existing constructors all require InputStreams, but bigdata tends to invoke the RDFParserFactory with a Reader.

          People

          • Assignee:
            Unassigned
            Reporter:
            bryanthompson bryanthompson
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: