Process multiple files in parallel #770

lydell · 2022-08-28T11:33:04Z

Fixes #755. Related to #183.

On my computer, it’s roughly 2.5 times faster! 🎉

command	main	this PR
elm-format --validate .	7.7 s	2.9 s
elm-format --yes . >/dev/null	7.7 s	2.9 s
elm-format --yes .	7.7 s	3.3 s

main refers to commit 1557892.

Tested on a code base of this size:

❯ tokei --type Elm
===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Elm                   745       221612       178984         5131        37497

Computer specs:

macOS Monterey
CPU: 2,6 GHz 6-Core Intel Core i7
RAM: 16 GB 2400 MHz DDR4

Terminal: elm-format --yes . (without >/dev/null) depends on your Terminal. With the default Apple Terminal, I got 3.3 seconds as shown in the table above. With iTerm2, I got 5.5 seconds (and 9.7 seconds for main).

Notes:

This PR seems to work, but I haven’t tested all aspects extensively.
I have only tested on macOS, not Linux or Windows.
I’m a Haskell beginner, so I’ve probably done some weird things. I even know some of them: I generate JSON for the list of validation objects a bit manually, and I turn errors into JSON earlier than feels logical to avoid a “deriving NFData hell” that I couldn’t figure out. I took all shortcuts I could just to win over the compiler :)
It feels like I completely changed how InfoFormatter works. Sorry if that doesn’t fit with your vision for it!
I use @supermario’s hack to avoid garbled printouts: https://github.com/supermario/elm-tooling-compiler/blob/0f63c27e0f334ca680a568034953f9b3d16ba3db/ext-common/Ext/Common.hs#L82-L94
I didn’t spend much time thinking about code style. It’s probably all over the place and not aligned with the rest of the code base!
I use pooled-io which is on version 0.0.2.2 – that’s a very low number! I picked that dependency because it was the only one I understood how to use. It is only used to implement World.mapMConcurrently (a one-liner) though, so it might be easy to replace by someone better at Haskell.

avh4 · 2022-08-29T18:56:03Z

Awesome, thanks!!

Linux: I'll be able to test that
Windows: I'll make sure to have someone check a release candidate build before the next release (or I guess I need to make github CI save the windows artifacts for PRs)
pooled-io 0.0.2.2: low version number doesn't bother me too much, as long as the library is pretty small and self-contained. Also in this case, I did look at pooled-io's source code, and it looks pretty straightforward as a wrapper over more common Haskell concurrency APIs, so this seems totally fine to me
code style: I haven't really settled on a consistent one anyway 😅. Everything I saw so far looks good for code style.
InfoFormatter: I've been trying to get rid of it for a while, so I'm pretty much fine with anything here

Apologies, I'll probably be a little bit slow on merging this since I want to re-read the InfoFormatter changes a couple times, and make sure I personally understand how Haskell's laziness works w/r to the new code. (I trust that doesn't really break anything, both from a quick skim of the code, and from the testing you've done; but I want to put a bit higher of a bar on myself to understand what the runtime behavior actually is.)

Here are a couple things I'll want to take a look at before I merge. Feel free to take a stab at them if you want, or no worries if you don't, I should have a chance to get to them in the next couple weeks.

I'm thinking using the print lock, it should be possible now to have the forked jobs just print their output, and then only have to pass back a Bool to the parent, and that maybe also would restore the streaming output of the json
I'd like to get rid of the unsafePerformIO hack, which I think I'd do by making a new monad stack for the "real" World instance, which would be something like type RealWorld = ReaderT (MVar ()) IO, make a MonadIO RealWorld instance (maybe? I think this might make it less verbose to refactor the World instance? -- edit: oh, maybe ReaderT r IO already has a MonadIO instance?), and then refactor instance World IO to be instance World RealWorld (and the top-level entrypoint then will create the printLock and use runReaderT printLock)
(probably won't look at now for this PR, but might someday in the future) rewrite TransformFiles to use conduit, and then use conduit-concurrent-map instead of pooled-io.

avh4 · 2023-02-10T07:23:54Z

Testing version for MacOS arm64 is available here: https://github.com/avh4/elm-format/actions/runs/4141690402

avh4 · 2023-02-10T22:15:03Z

Just an additional benchmark on a Mac M1 using --validate on ~150k LOC:

(using https://github.com/sharkdp/hyperfine hyperfine --warmup 3 --shell=none)

Version	Mean [s]	Min [s]	Max [s]	Relative
darwin-arm64
this PR	0.824 ± 0.006	0.815	0.832	1.00
`main`	2.776 ± 0.015	2.758	2.805	3.37 ± 0.03
darwin-x86 via Rosetta
this PR	1.058 ± 0.012	1.042	1.085	1.28 ± 0.02
`main`	9.603 ± 0.096	9.475	9.770	11.65 ± 0.14

avh4 · 2023-02-11T03:55:07Z

I'm thinking using the print lock, it should be possible now to have the forked jobs just print their output, and then only have to pass back a Bool to the parent, and that maybe also would restore the streaming output of the json

I took a brief look at doing this now, but it doesn't seem worth the time atm. Doing it in a clean way seems like it would require a way of coordinating an additional thread, which would fit better with the possible future work of using conduit.

Merge #770

avh4 · 2023-02-11T04:57:13Z

Split off #788 for possible future refactoring.

avh4 · 2023-02-11T04:57:33Z

Merged via #789

Thank you!!

lydell added 10 commits August 21, 2022 12:34

WIP World.mapMConcurrently with Control.Concurrent.PooledIO.Final

29f622b

Messy way to finally be able to try it out

18aa375

WIP print JSON

3f1545a

Avoid NFData hell by being more stringly typed

8571fca

Uncomment some tests and make them pass

b6c673f

Uncomment some more tests and make them pass

0fc34a4

Get rid of StateT

cb433ce

Overwrite files concurrently

30a4cd9

Fix grabled printouts

afcf2c9

Default to automatic number of processors

4449df2

avh4 added this to the 0.8.6 milestone Aug 29, 2022

Remove unsafePerformIO hack from World.IO

a19b2b6

avh4 added the performance label Sep 13, 2022

avh4 self-assigned this Jan 26, 2023

Merge branch 'main' into concurrent

5a4571c

This was referenced Feb 11, 2023

Use stm-conduit for logging #788

Open

Merge #770 #789

Merged

avh4 added a commit that referenced this pull request Feb 11, 2023

Merge pull request #789 from avh4/snapshot/concurrent

63a81c4

Merge #770

avh4 merged commit 868eb08 into avh4:main Feb 11, 2023

lydell deleted the concurrent branch February 11, 2023 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process multiple files in parallel #770

Process multiple files in parallel #770

lydell commented Aug 28, 2022 •

edited

Loading

avh4 commented Aug 29, 2022 •

edited

Loading

avh4 commented Feb 10, 2023

avh4 commented Feb 10, 2023

avh4 commented Feb 11, 2023

avh4 commented Feb 11, 2023

avh4 commented Feb 11, 2023

Process multiple files in parallel #770

Process multiple files in parallel #770

Conversation

lydell commented Aug 28, 2022 • edited Loading

avh4 commented Aug 29, 2022 • edited Loading

avh4 commented Feb 10, 2023

avh4 commented Feb 10, 2023

avh4 commented Feb 11, 2023

avh4 commented Feb 11, 2023

avh4 commented Feb 11, 2023

lydell commented Aug 28, 2022 •

edited

Loading

avh4 commented Aug 29, 2022 •

edited

Loading