feat(findExports): use acorn tokenizer to filter false positive exports #56

hubvue · 2022-06-29T11:51:28Z

This PR is designed to address this issue #34

example:

// export { foo } from 'foo1';
// exports default 'foo';
// export { useB, _useC as useC };
// export function useA () { return 'a' }
// export { default } from "./other"
// export async function foo () {}
// export { foo as default }
//export * from "./other"
//export * as foo from "./other"

/**
 * export const a = 123
 * export { foo } from 'foo1';
 * exports default 'foo'
 * export function useA () { return 'a' }
 * export { useB, _useC as useC };
 *export { default } from "./other"
 *export async function foo () {}
 * export { foo as default }
 * export * from "./other"
 export * as foo from "./other"
 */

export { bar } from 'foo2';
export { foobar } from 'foo2';

The expected length of matches should be "2", but instead we get "17".

'\b' does not solve such problems, its semantics is the junction of words and non-words. Apparently the following code is also semantically correct.

// (\b is here)export { foo } from 'foo1';

Solutions：filter out comments before matching to avoid them affecting the matching process.

…ch export

src/analyze.ts

pi0 · 2022-06-29T16:33:10Z

We can also use strip-literal by @antfu. However, downside is that it involves full AST parsing and sloweer.

I think this is last tested version based on regex.

@antfu Can you please explain finally what were limitations regex couldn't support? Did you migrate unimport for a specific reason or avoiding possible edge cases in the future?

antfu · 2022-06-30T02:00:58Z

Here are the cases that regexp can't possibly handle correctly (either incompletely removed, or overkill with quotes unmatched.

https://github.com/antfu/strip-literal/blob/4604f3c1d5849af27f6b74635249907472e8df4c/test/index.test.ts#L24-L111

Context:

In addition:

strip-literal only does tokenizing instead of full parsing. It's probably the most efficient way to do this correctly.
While tokenizing requires the input to be valid JavaScript, I also introduced stripLiteralRegex as a fallback (implementation inherited from vite and unimport), you can either call it directly, or use stripLiteral() to do it with auto fallback.

hubvue · 2022-06-30T04:01:48Z

Using strip-literal to filter noise is indeed a good option, but a problem arose during my testing.
example:

export { foobar } from 'foo2';

After strip-literal processing

export { foobar } from '     ';

Expected namedExports:

[{
    type: 'named',
    exports: ' bar ',
    specifier: 'foo2',
    code: "export { bar } from 'foo2';",
    start: 121,
    end: 148,
    names: [ 'bar' ]
}]

Received namedExports:

[{
    type: 'named',
    exports: ' bar ',
    specifier: undefined,
    code: 'export { bar }',
    start: 794,
    end: 808,
    names: [ 'bar' ]
}]

specifier got undefined

hubvue · 2022-06-30T06:48:02Z

I have found that the presence of an export statement in the string also affects the result.

example:

const test1 = "export { ba1 } from 'foo2'"
const test2 = "testexport { bar2 } from 'foo2'"
const test3 = "test export { bar3 } from 'foo2'"
const test4 = "export { bar4 } from 'foo2' test"
const test5 = \`
  test1
  export { bar4 } from 'foo2' test
  test2
`

expected 0 got 4.

regexp matching is too cumbersome and perhaps AST is an option (but only if performance is accepted).

pi0 · 2022-06-30T08:25:15Z

Thanks for information @antfu. I guess performance would be reasonable and we can introduce two versions of find* utils for a regex-only extractor when performance/bundle is matter.

Can we maybe support a filter function from strip-literal? We want to avoid stripping strings that do not include an import for mlly case.

antfu · 2022-07-01T08:33:52Z

Can we maybe support a filter function from strip-literal?

While we could do it, I am not sure if there is a good way to detect whether a string should be kept or striped. Since strip-literal does not change the position of content, I would suggest maybe detecting the exports using the striped code, while reading the content using the original.

…c exports

hubvue · 2022-07-01T09:38:15Z

I would suggest maybe detecting the exports using the striped code, while reading the content using the original.

@antfu This is a great idea, I've been trying to figure out how to parse and filter out the export statements in the strings all morning and still haven't had a good result.

pi0

Nice changes!

src/analyze.ts

hubvue · 2022-07-11T03:44:32Z

@pi0 Is there any progress on this PR?

codecov · 2022-07-22T02:44:48Z

Codecov Report

Merging #56 (0a61617) into main (83502b4) will increase coverage by 0.36%.
The diff coverage is 76.00%.

@@            Coverage Diff             @@
##             main      #56      +/-   ##
==========================================
+ Coverage   52.52%   52.89%   +0.36%     
==========================================
  Files          13       13              
  Lines        2115     2159      +44     
  Branches      171      179       +8     
==========================================
+ Hits         1111     1142      +31     
- Misses        842      847       +5     
- Partials      162      170       +8

Impacted Files	Coverage Δ
src/analyze.ts	`80.68% <76.00%> (-3.41%)`	⬇️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

pi0 · 2022-08-03T09:53:16Z

Thanks for working on this PR @hubvue <3

hubvue · 2022-08-03T10:00:41Z

@pi0 I feel very happy to be able to contribute to an open source project. 😄

hubvue added 4 commits June 29, 2022 19:43

fix: filter out commented code to eliminate the effect of regular mat…

7e7624d

…ch export

test: update test case name

ec5496e

test: delete comment

79af763

fix: update match type

3d365cb

pi0 reviewed Jun 29, 2022

View reviewed changes

src/analyze.ts Outdated Show resolved Hide resolved

pi0 closed this Jun 29, 2022

pi0 reopened this Jun 29, 2022

feat: use replace + reg filter comments

b620f3c

feat: update match comments regexp

95934c6

feat: filtering export statements in strings

bbfc0ce

feat: use the tokenizer to analyse the code and filter out unsyntacti…

33365b0

…c exports

hubvue requested a review from pi0 July 1, 2022 09:38

pi0 reviewed Jul 1, 2022

View reviewed changes

src/analyze.ts Outdated Show resolved Hide resolved

src/analyze.ts Outdated Show resolved Hide resolved

src/analyze.ts Outdated Show resolved Hide resolved

src/analyze.ts Outdated Show resolved Hide resolved

feat: rename & supplementary types

9ab5734

hubvue requested a review from pi0 July 1, 2022 14:48

chore: merge remote main branch

4ceb9ad

pi0 changed the title ~~fix: filter out commented code to eliminate the effect of regular mat…~~ feat(findExports): use acorn tokenizer Aug 3, 2022

pi0 added 4 commits August 3, 2022 11:42

Merge branch 'main' into pr/hubvue/56

a84bed2

update lockfile

a1b5113

add back acorn dep

f1bd672

perf: tokenize if there are export matches

0a61617

pi0 changed the title ~~feat(findExports): use acorn tokenizer~~ feat(findExports): use acorn tokenizer to filter false positive exports Aug 3, 2022

pi0 approved these changes Aug 3, 2022

View reviewed changes

pi0 merged commit 7039f54 into unjs:main Aug 3, 2022

hubvue mentioned this pull request Aug 5, 2022

chore(deps): upgrade mlly to resolve the presence of commented export statements in pkg antfu/pkg-exports#4

Merged

pcbowers mentioned this pull request Feb 27, 2023

findStaticImports and findDynamicImports do not respect comments #153

Closed

antfu mentioned this pull request Mar 1, 2023

fix(analyze): ignore conmments for imports detection #155

Merged

pi0 mentioned this pull request Jan 11, 2024

feat: make stripComments optional for syntax detection #217

Merged

8 tasks

pi0 mentioned this pull request Oct 6, 2024

fix: comment stripping should remove multiline comments (#279) #280

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(findExports): use acorn tokenizer to filter false positive exports #56

feat(findExports): use acorn tokenizer to filter false positive exports #56

hubvue commented Jun 29, 2022 •

edited

Loading

pi0 commented Jun 29, 2022

antfu commented Jun 30, 2022 •

edited

Loading

hubvue commented Jun 30, 2022

hubvue commented Jun 30, 2022

pi0 commented Jun 30, 2022

antfu commented Jul 1, 2022

hubvue commented Jul 1, 2022

pi0 left a comment

hubvue commented Jul 11, 2022

codecov bot commented Jul 22, 2022 •

edited

Loading

pi0 commented Aug 3, 2022

hubvue commented Aug 3, 2022

feat(findExports): use acorn tokenizer to filter false positive exports #56

feat(findExports): use acorn tokenizer to filter false positive exports #56

Conversation

hubvue commented Jun 29, 2022 • edited Loading

pi0 commented Jun 29, 2022

antfu commented Jun 30, 2022 • edited Loading

hubvue commented Jun 30, 2022

hubvue commented Jun 30, 2022

pi0 commented Jun 30, 2022

antfu commented Jul 1, 2022

hubvue commented Jul 1, 2022

pi0 left a comment

Choose a reason for hiding this comment

hubvue commented Jul 11, 2022

codecov bot commented Jul 22, 2022 • edited Loading

Codecov Report

pi0 commented Aug 3, 2022

hubvue commented Aug 3, 2022

hubvue commented Jun 29, 2022 •

edited

Loading

antfu commented Jun 30, 2022 •

edited

Loading

codecov bot commented Jul 22, 2022 •

edited

Loading