Check if file is ignored according to git #98

Canop · 2022-12-29T08:36:17Z

Doesn't launch the job if the file is ignored

There's a configuration parameter at the job level to disable this filtering.
Fix #32

doesn't launch the job if the file is ignored (this should be guarded by a configuration parameter)

Canop · 2022-12-29T09:32:37Z

To test this and have more insight about the exclusion process, launch it with info level log:

BACON_LOG=info bacon

in the job configuration

Byron · 2022-12-29T20:27:43Z

src/ignorer.rs

+        let index = worktree.index()?;
+
+        // there doesn't seem to be any public API for looking at "excludes" without caching
+        // so we create a cache


Cache is probably a misnomer here as it doesn't cache anything - it merely builds state to be able to perform these lookups and be fast with non-random input ordered according to the git index.
I think I ran out of words as it contains an ignore stack internally, and this is adding even more information to bring everything together.

Byron · 2022-12-29T20:47:02Z

src/ignorer.rs

+
+    /// Tell whether the given path is excluded according to
+    /// either the global gitignore rules or the ones of the repository
+    pub fn excludes(&mut self, file_path: &Path) -> Result<bool> {


I think as it stands, each path will trigger reading all gitignore files. They are indeed held in the excludes() data structure and ideally it is kept. I see how this isn't possible right now, and believe that the current reference is likely a premature optimization rather than a necessity. This will change for sure - and it's done in the latest main, which should greatly improve performance as the cache can actually reuse state if it's kept around.

the current reference is likely a premature optimization rather than a necessity

I feel you. I fell in the same trap in my first libs too, and it's a pain to fix.

TBH checking whether a file is excluded takes less than 1 ms right now and that's fine for bacon.

I will think about this - the number one usage of lifetimes in structs is platforms, they add some data around an operation and perform it, keeping a reference to their originating Repository. As long as these are basically free, I think they can be created on demand with the Repository being cloned to where it is needed - that's the intent.

If they aren't free though, like the Cache here, I think it's good advice to rather clone the Repository into it to make it standalone, or do whatever else it takes. I think some useful rule emerges from this experience and I will put it into words in DEVELOPMENT.md to make it official.

BTW @Byron I don't know if you may find this interesting, but I've also implemented a gitignoring stack (stacking the parsed gitignore files as I imagine you do): https://github.com/Canop/broot/blob/main/src/git/ignore.rs#L155

This was done for broot with very specific performance concerns (breathfirst tree diving).
The reasons I didn't take that for bacon were

I didn't want to implement myself looking for the global gitignore rules (I use git2 in broot for that)

I wanted to try gitoxide for other programs of mine (and maybe replacing git2 with gitoxide in my current programs)

Thanks for sharing! I love the perceived simplicity of the git-ignore implementation, it fits in 240 lines after all :)!

With the Cache type unchained from the lifetime in main you would now be able to reuse it for each lookup and that should yield much better performance.

From a correctness point of view, it's probably (hopefully) a good idea to use gitoxide even if the performance is just similar, as I tried my best to validate the implementation against git with many many test cases. Of course I hope you won't take my word for it and validate it yourself, gitoxide strives to yield the same results as git.

Thus, I hope you will end up using gitoxide in more of your projects and if there is anything preventing that, I'd love to know to get a chance to fix it.

gitignoring is only part of using git, and I definitely don't want to expand into building a general git crate while there's already an ambitious project, so it's my clear intention to try and use gitoxide ;)

Related to #675 and Canop/bacon#98.

Canop added 2 commits December 28, 2022 20:14

update dependencies

e11b4c3

check if modified file is ignored according to git

d81e8eb

doesn't launch the job if the file is ignored (this should be guarded by a configuration parameter)

Canop mentioned this pull request Dec 29, 2022

Ignore some files #32

Closed

gitignore filtering can now be disabled

9cc6119

in the job configuration

Canop marked this pull request as ready for review December 29, 2022 13:56

Byron reviewed Dec 29, 2022

View reviewed changes

lower log level of gitignore stuff

32f2485

Byron added a commit to GitoxideLabs/gitoxide that referenced this pull request Dec 30, 2022

Set a stance regarding lifetimes usage.

efe0c13

Related to #675 and Canop/bacon#98.

Canop merged commit 42cc010 into main Dec 30, 2022

Canop deleted the ignore branch December 30, 2022 19:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check if file is ignored according to git #98

Check if file is ignored according to git #98

Canop commented Dec 29, 2022 •

edited

Loading

Canop commented Dec 29, 2022 •

edited

Loading

Byron Dec 29, 2022

Byron Dec 29, 2022

Canop Dec 29, 2022 •

edited

Loading

Byron Dec 30, 2022

Canop Dec 30, 2022 •

edited

Loading

Byron Dec 30, 2022

Canop Dec 30, 2022

Check if file is ignored according to git #98

Check if file is ignored according to git #98

Conversation

Canop commented Dec 29, 2022 • edited Loading

Canop commented Dec 29, 2022 • edited Loading

Byron Dec 29, 2022

Choose a reason for hiding this comment

Byron Dec 29, 2022

Choose a reason for hiding this comment

Canop Dec 29, 2022 • edited Loading

Choose a reason for hiding this comment

Byron Dec 30, 2022

Choose a reason for hiding this comment

Canop Dec 30, 2022 • edited Loading

Choose a reason for hiding this comment

Byron Dec 30, 2022

Choose a reason for hiding this comment

Canop Dec 30, 2022

Choose a reason for hiding this comment

Canop commented Dec 29, 2022 •

edited

Loading

Canop commented Dec 29, 2022 •

edited

Loading

Canop Dec 29, 2022 •

edited

Loading

Canop Dec 30, 2022 •

edited

Loading