Uploaded image for project: 'Blazegraph (by SYSTAP)'
  1. Blazegraph (by SYSTAP)
  2. BLZG-747

SPARQL REGEX operator does not perform case-folding correctly for Unicode data

    Details

      Description

      In versions up to 1.2.2, there is a bug in the SPARQL REGEX operator such that it does not perform case-folding correctly for Unicode data. You can work around this by specifying the 'u' flag.

      This issue was described in [1]

      [1] https://sourceforge.net/projects/bigdata/forums/forum/676946/topic/7073971

        Activity

        Hide
        bryanthompson bryanthompson added a comment -

        I have modified the RegexBOp to automatically specify 'u' when 'i' is given (forces Unicode case-folding) and to make specifying 'u' by itself a NOP.

        I have verified that there are no test failures in the SPARQL query test suite related to this change.

        Committed Revision r7018.

        Show
        bryanthompson bryanthompson added a comment - I have modified the RegexBOp to automatically specify 'u' when 'i' is given (forces Unicode case-folding) and to make specifying 'u' by itself a NOP. I have verified that there are no test failures in the SPARQL query test suite related to this change. Committed Revision r7018.
        Hide
        bryanthompson bryanthompson added a comment -

        I have added a unit test based on the example provided in the forum post (using ASCII encoding of Unicode characters). The test fails without the RegexBOp modification and passes with that modification.

        Committed Revision r7019.

        Show
        bryanthompson bryanthompson added a comment - I have added a unit test based on the example provided in the forum post (using ASCII encoding of Unicode characters). The test fails without the RegexBOp modification and passes with that modification. Committed Revision r7019.

          People

          • Assignee:
            bryanthompson bryanthompson
            Reporter:
            bryanthompson bryanthompson
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved: