test: unencoded subpath cannot contain . or .. #368

jeremylong · 2024-12-10T11:39:32Z

Add test cases to check for invalid . and .. segments in the subpath.

supercedes package-url#52 Signed-off-by: Jeremy Long <[email protected]>

test-suite-data.json

jkowalleck · 2025-02-13T13:59:33Z

see also: https://github.com/package-url/purl-spec/pull/394/files#r1954552460

matt-phylum · 2025-02-13T14:09:46Z

I'm wondering if this is actually a good idea. I can agree that the subpath should not contain . or .. segments, and I would prefer if implementations didn't need path normalization logic, but I don't think parsers and formatters should validate that the subpath does not contain . or .. segments. The problem is that if you observe a PURL pkg:golang/google.golang.org/genproto@abcdedf#/googleapis/../api/annotations/ it cannot be understood at all if the parser rejects the .., and it cannot be canonicalized or otherwise parsed and then written back out if the formatter rejects the ... It's syntactically valid and at least the type name version have an clear, unambiguous meaning, but because of the .. everything is thrown away.

jkowalleck · 2025-02-13T14:47:09Z

[...] but I don't think parsers and formatters should validate that the subpath does not contain . or .. segments.

why? i mean, this is already part of the spec:

spec

purl-spec/PURL-SPECIFICATION.rst

Line 212 in 65d98c4

- must not be any of '..' or '.'
generate

purl-spec/PURL-SPECIFICATION.rst

Line 322 in 65d98c4

- Discard empty, '.' and '..' segments
parse

purl-spec/PURL-SPECIFICATION.rst

Line 346 in 65d98c4

- Discard any '.' or '..' segment from that split

The problem is that if you observe a PURL pkg:golang/google.golang.org/genproto@abcdedf#/googleapis/../api/annotations/ it cannot be understood at all if the parser rejects the .., and it cannot be canonicalized or otherwise parsed and then written back out if the formatter rejects the ... It's syntactically valid and at least the type name version have an clear, unambiguous meaning, but because of the .. everything is thrown away.

per spec, the .. and . are to be discarded, rest is to be kept untouched. to attempt of path-resolution or anything. foo/../bar becomes foo/bar. (not bar)

matt-phylum · 2025-02-13T14:50:00Z

Then these tests are wrong, right? They have "is_invalid": true, and they include the . and .. in the parsed subpaths.

jkowalleck · 2025-02-13T14:59:25Z

test-suite-data.json

+    "version": "abcdedf",
+    "qualifiers": null,
+    "subpath": "googleapis/./api/annotations",
+    "is_invalid": true


lets see how the different test scenarios behave:

passing - parsing the test canonical purl then re-building a purl from these parsed components should return the test canonical purl. there si no way o get this self-fulfilling prophecy

passed, as the . is expected to be stripped - parsing the test purl should return the components parsed from the test canonical purl

passing, as the . is expected to be stripped - parsing the test purl then re-building a purl from these parsed components should return the test canonical purl

passing - building a purl from the test components should return the test canonical purl

I think also it also fails because the parse succeeds without an error, but is_invalid indicates that an error is expected.

revisited the tests. you are right - all test are passing.
will set the is_invalid from true to false

jkowalleck · 2025-02-13T14:59:56Z

test-suite-data.json

+    "version": "abcdedf",
+    "qualifiers": null,
+    "subpath": "googleapis/../api/annotations",
+    "is_invalid": true


lets see how the different test scenarios behave:

passing - parsing the test canonical purl then re-building a purl from these parsed components should return the test canonical purl. there si no way o get this self-fulfilling prophecy failing, anyway.

passed, as the .. is expected to be stripped - parsing the test purl should return the components parsed from the test canonical purl

passed, as the .. is expected to be stripped - parsing the test purl then re-building a purl from these parsed components should return the test canonical purl

passing - building a purl from the test components should return the test canonical purl

test-suite-data.json

matt-phylum · 2025-02-13T15:51:11Z

test-suite-data.json

+    "name": "genproto",
+    "version": "abcdedf",
+    "qualifiers": null,
+    "subpath": "googleapis/../api/annotations",


Should this be with the .. or without? If we're using the value as input to verify that formatting removes the .., then we want .. here, but if we're verifying that parsing removes the .. we wouldn't want it here.

here, but if we're verifying that parsing removes the .. we wouldn't want it here

The same thing probably confused the original author (and me at first review).

that was the reason i looked up the intended test scenarios.

purl-spec/PURL-SPECIFICATION.rst

Lines 473 to 485 in 65d98c4

To test ``purl`` parsing and building, a tool can use this test suite and for

every listed test object, run these tests:

- parsing the test canonical ``purl`` then re-building a ``purl`` from these parsed

components should return the test canonical ``purl``

- parsing the test ``purl`` should return the components parsed from the test

canonical ``purl``

- parsing the test ``purl`` then re-building a ``purl`` from these parsed components

should return the test canonical ``purl``

- building a ``purl`` from the test components should return the test canonical ``purl``

so there is no case where any result is compared against the test components.
there are only cases where the test components are used as input and are expected to produce a canonicalized PURL.

test: unencoded subpath cannot contain . or ..

7c5ebfc

supercedes package-url#52 Signed-off-by: Jeremy Long <[email protected]>

jeremylong mentioned this pull request Dec 10, 2024

Update Test Suite #52

Closed

johnmhoran added PURL subpath component Ecma specification Work on the core specification PURL encoding labels Dec 11, 2024

johnmhoran mentioned this pull request Jan 21, 2025

Clarify spec for subpath #379

Open

1 task

jkowalleck self-requested a review February 13, 2025 13:46

jkowalleck previously approved these changes Feb 13, 2025

View reviewed changes

test-suite-data.json Outdated Show resolved Hide resolved

test-suite-data.json Outdated Show resolved Hide resolved

jkowalleck added the Test suite label Feb 13, 2025

escape '..'

e228898

jkowalleck dismissed their stale review via e228898 February 13, 2025 13:49

escape '.'

f47c2cb

jkowalleck previously approved these changes Feb 13, 2025

View reviewed changes

jkowalleck mentioned this pull request Feb 13, 2025

docs(subpath): revisited #394

Merged

3 tasks

This comment was marked as outdated.

Sign in to view

jkowalleck reviewed Feb 13, 2025

View reviewed changes

jkowalleck self-requested a review February 13, 2025 15:10

jkowalleck reviewed Feb 13, 2025

View reviewed changes

test-suite-data.json Outdated Show resolved Hide resolved

jkowalleck reviewed Feb 13, 2025

View reviewed changes

test-suite-data.json Outdated Show resolved Hide resolved

Apply suggestions from code review

304eee2

jkowalleck dismissed their stale review via 304eee2 February 13, 2025 15:25

jkowalleck approved these changes Feb 13, 2025

View reviewed changes

matt-phylum reviewed Feb 13, 2025

View reviewed changes

matt-phylum approved these changes Feb 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: unencoded subpath cannot contain . or .. #368

test: unencoded subpath cannot contain . or .. #368

jeremylong commented Dec 10, 2024

jkowalleck commented Feb 13, 2025

matt-phylum commented Feb 13, 2025

jkowalleck commented Feb 13, 2025 •

edited

Loading

matt-phylum commented Feb 13, 2025 •

edited

Loading

This comment was marked as outdated.

jkowalleck Feb 13, 2025 •

edited

Loading

matt-phylum Feb 13, 2025

jkowalleck Feb 13, 2025 •

edited

Loading

jkowalleck Feb 13, 2025 •

edited

Loading

matt-phylum Feb 13, 2025

jkowalleck Feb 13, 2025 •

edited

Loading

	To test ``purl`` parsing and building, a tool can use this test suite and for
	every listed test object, run these tests:

	- parsing the test canonical ``purl`` then re-building a ``purl`` from these parsed
	components should return the test canonical ``purl``

	- parsing the test ``purl`` should return the components parsed from the test
	canonical ``purl``

	- parsing the test ``purl`` then re-building a ``purl`` from these parsed components
	should return the test canonical ``purl``

	- building a ``purl`` from the test components should return the test canonical ``purl``

test: unencoded subpath cannot contain . or .. #368

Are you sure you want to change the base?

test: unencoded subpath cannot contain . or .. #368

Conversation

jeremylong commented Dec 10, 2024

jkowalleck commented Feb 13, 2025

matt-phylum commented Feb 13, 2025

jkowalleck commented Feb 13, 2025 • edited Loading

matt-phylum commented Feb 13, 2025 • edited Loading

This comment was marked as outdated.

jkowalleck Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

matt-phylum Feb 13, 2025

Choose a reason for hiding this comment

jkowalleck Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

jkowalleck Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

matt-phylum Feb 13, 2025

Choose a reason for hiding this comment

jkowalleck Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

jkowalleck commented Feb 13, 2025 •

edited

Loading

matt-phylum commented Feb 13, 2025 •

edited

Loading

jkowalleck Feb 13, 2025 •

edited

Loading

jkowalleck Feb 13, 2025 •

edited

Loading

jkowalleck Feb 13, 2025 •

edited

Loading

jkowalleck Feb 13, 2025 •

edited

Loading