Array slices #22

glyn · 2020-10-08T16:04:00Z

No description provided.

gregsdennis · 2020-10-08T19:18:54Z

I think we need to add an "optional" property to the tests (or break them out to another section). The "big num" cases can't be supported by all frameworks, and we shouldn't expect it of them. There will be other cars like this as well.

I know we have a "skip" setting, but that means modifying the CTS after I've downloaded it. That should be considered bad practice. The suite should run as-is.

src/ast.rs

mkmik · 2020-10-09T09:41:29Z

src/ast.rs

+            }; // avoid CPU attack
+            for i in (strt..e).step_by(step as usize) {
+                if i < len {
+                    sl.push(&arr[i]);


The spec says:

A negative index j selects an element of an array of length len if and only if 0 <= j + len < len, in which case it selects the same element as the non-negative index j + len.

then it describes the iterations as:

for (i = start; i < end; i = i + step) { ... }

what should happen if you call this on an input slice of length 5?:

let s = array_slice(j.as_array().unwrap(), -5, 0, 1);

My interpretation is that it would yield

i = -5; i < 0 // true push(arr[i + len]) // arr[-5+5] === arr[0] i = i+1 // -4 i < 0 // true push(arr[i + len]) // arr[-4+5] === arr[1] ...

This is a good example of a reference implementation and compliance test suite turning up a hole in the spec.

I think the spec should say (only more formally/precisely) that negative start is syntactic sugar for start + len and similarly for negative end. But then start and end need to be desugared before plugging them into the relevant for loop.

So, for an array of length 5, [-5, 0, 1] corresponds to the elements indexed by the values of i in the for loop:

for (i = 0; i < 0; i = i + 1) { ... }

of which there are none, so the result is empty.

Desugaring before iterating makes most sense to me as well.

I was tinkering with the consequences if doing it the other way around and I don't think it would provide any advantage. An interesting consequence is that it allows the resulting slice to be longer than the input array (and contain repeated elements) which I think would be quite confusing.

I think the implementation is correct and we just need to fix the spec. ;-)

the privileges of a spec writer! :-)

I wonder if something like this would have less special casing: https://play.rust-lang.org/?version=stable&mode=debug&edition=2018&gist=dd7180d6586e5a511b4e56b0058e5562

While I don't think it's fully correct w.r.t overflows etc, but has a few advantages:

no temporary array; just uses rust's own array slice iterator and the ability to reverse any iterator

no risk of "cpu attack", since we're not iterating user provided bounds anyway; worst case of bugs we get a panic accessing a slice out of bounds

less duplication of code; the two branches of step > 0 and step < 0 in the PR contain quite a lot of common code with subtle differences

glyn · 2020-10-12T09:00:02Z

I think we need to add an "optional" property to the tests (or break them out to another section). The "big num" cases can't be supported by all frameworks, and we shouldn't expect it of them. There will be other cars like this as well.

The spec is currently silent about supported precisions of array indices. Probably needs firming up in some way.

It's interesting because for any given language or machine architecture, there will exist large values which exceed the built-in capabilities of the language or architecture.

I know we have a "skip" setting, but that means modifying the CTS after I've downloaded it. That should be considered bad practice. The suite should run as-is.

Depending on how the spec pans out, one option would be to add some kind of "category" field to the tests so that those which may be optional in some sense can be skipped en masse. I'm reticent to do that prematurely. Why not clamp the values to a suitable range (e.g. signed 31 bit) for now? Or is that too inconvenient?

gregsdennis · 2020-10-12T18:56:36Z

I'm happy for this to merge for now. We can come back to it when the spec addresses it. Let's discuss in your new issue.

glyn · 2020-10-15T09:37:16Z

I'm happy for this to merge for now. We can come back to it when the spec addresses it. Let's discuss in your new issue.

I agree the code is good, but I'd prefer to maintain consistency with the spec. I'll merge this PR once the spec is updated.

src/parser.rs

Co-authored-by: Marko Mikulicic <[email protected]>

glyn · 2020-10-20T09:16:53Z

Blocked on ietf-wg-jsonpath/draft-ietf-jsonpath-base#31

2^257=231584178474632390847141970017375815706539969331281128078915168015826259279872 will overflow signed 256 bit integers. This can be increased if implementations surface which can cope with such values.

glyn · 2020-10-28T15:13:32Z

@gregsdennis I'd appreciate your review of the final commit which tests the behaviour when integer representations of array indices and slice parameters overflow. This aims to enforce the current spec.

gregsdennis

Shouldn't the implementation at least attempt to parse the invalid selectors? It looks like cts.rs is just skipping ones that are marked as invalid.

glyn · 2020-10-29T02:51:40Z

@gregsdennis wrote:

Shouldn't the implementation at least attempt to parse the invalid selectors? It looks like cts.rs is just skipping ones that are marked as invalid.

I'm not sure what gives that impression. The parse function of the implementation is executed in the code below from cts.rs regardless of whether the testcase says the selector is invalid:

                let path = jsonpath::parse(&t.selector);

                if let Ok(ref p) = path {

The if statement then tests that parsing failed if and only if the testcase says the selector is invalid.

gregsdennis · 2020-10-30T04:07:38Z

Shows how much I know about rust.

glyn · 2020-11-04T15:22:58Z

I think this is ready to merge now the spec is updated. @mkmik or @gregsdennis: please approve.

glyn added 3 commits October 8, 2020 16:33

Array slicing

3ee433e

Prefer isize to i64

82a630f

Remove unnecessary parameter

4099d13

glyn marked this pull request as draft October 8, 2020 16:11

glyn requested a review from mkmik October 8, 2020 16:11

mkmik reviewed Oct 9, 2020

View reviewed changes

src/ast.rs Outdated Show resolved Hide resolved

mkmik reviewed Oct 9, 2020

View reviewed changes

Address review comments

81d4d2c

glyn mentioned this pull request Oct 12, 2020

Define behaviour for very large array indices and slice components ietf-wg-jsonpath/draft-ietf-jsonpath-base#29

Closed

rebase on slyce

df0a708

glyn self-assigned this Oct 15, 2020

mkmik reviewed Oct 15, 2020

View reviewed changes

src/parser.rs Outdated Show resolved Hide resolved

mkmik mentioned this pull request Oct 16, 2020

Change array slicing specification ietf-wg-jsonpath/draft-ietf-jsonpath-base#31

Merged

Update src/parser.rs

43f7113

Co-authored-by: Marko Mikulicic <[email protected]>

glyn added 2 commits October 28, 2020 10:49

Bump slyce

e0022a6

Test unrepresentable array indices and slice parameters

bfe83b7

2^257=231584178474632390847141970017375815706539969331281128078915168015826259279872 will overflow signed 256 bit integers. This can be increased if implementations surface which can cope with such values.

glyn requested a review from gregsdennis October 28, 2020 15:12

gregsdennis reviewed Oct 28, 2020

View reviewed changes

glyn marked this pull request as ready for review November 4, 2020 15:21

glyn requested review from mkmik and gregsdennis November 4, 2020 15:22

mkmik approved these changes Nov 4, 2020

View reviewed changes

glyn merged commit 85a9fc3 into jsonpath-standard:main Nov 4, 2020

glyn deleted the array-slices branch November 4, 2020 15:50

Array slices #22

Array slices #22

Uh oh!

Conversation

glyn commented Oct 8, 2020

Uh oh!

gregsdennis commented Oct 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mkmik Oct 9, 2020

Choose a reason for hiding this comment

Uh oh!

glyn Oct 12, 2020

Choose a reason for hiding this comment

Uh oh!

mkmik Oct 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

glyn Oct 12, 2020

Choose a reason for hiding this comment

Uh oh!

mkmik Oct 12, 2020

Choose a reason for hiding this comment

Uh oh!

mkmik Oct 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

glyn commented Oct 12, 2020

Uh oh!

gregsdennis commented Oct 12, 2020

Uh oh!

glyn commented Oct 15, 2020

Uh oh!

Uh oh!

glyn commented Oct 20, 2020

Uh oh!

glyn commented Oct 28, 2020

Uh oh!

gregsdennis left a comment

Choose a reason for hiding this comment

Uh oh!

glyn commented Oct 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gregsdennis commented Oct 30, 2020

Uh oh!

glyn commented Nov 4, 2020

Uh oh!

Uh oh!

gregsdennis commented Oct 8, 2020 •

edited

Loading

mkmik Oct 12, 2020 •

edited

Loading

mkmik Oct 12, 2020 •

edited

Loading

glyn commented Oct 29, 2020 •

edited

Loading