Parallel algorithms POC #559

breznak · 2019-07-12T12:56:01Z

proof of concept of running c++17 parallel TS

This is for your interest, not intended for merging yet.

try execution::par_unseq for inhibition in SP. Results: takes much longer than seq.

This reverts commit b27b40c.

breznak

Please review if there's an obvious error or if the MNIST_SP.cpp settings may be too artificial.
Otherwise my parallelization effort using c++17 TS Parallel has hit the reality.

The problem is memory bound
execution::par_unseq really puts all threads to a good use
MNIST is still slower than single-thread!
- significantly: 60s (single) vs 240+s (parallel)
- likely broken cache locality
- or even CPU throttling (or boosting the only signle thread running)

breznak · 2019-07-17T11:46:24Z

src/htm/algorithms/SpatialPooler.cpp

@@ -844,19 +845,25 @@ void SpatialPooler::inhibitColumnsGlobal_(const vector<Real> &overlaps,
  // faster than a regular sort because it stops after it partitions the
  // elements about the Nth element, with all elements on their correct side of
  // the Nth element.
-  std::nth_element(
+  tNth.start();
+  std::nth_element(htm::parallel::mode,


inhibition identified as one of the slowest methods in SP.
nth_element as the most significant in global inh.

breznak · 2020-08-06T21:20:05Z

FYI, https://developer.nvidia.com/blog/accelerating-standard-c-with-gpus-using-stdpar/

breznak added 4 commits July 11, 2019 22:18

Parallel: compile with TBB if g++-9

4834086

Parallel: WIP parallel demo on MNIST

2e0760b

Merge branch 'master_community' into parallel_ts

7bb88c7

MNIST: provide both single, parallel version

b27b40c

breznak added in_progress optimization code code enhancement, optimization, cleanup..programmer stuff labels Jul 12, 2019

breznak self-assigned this Jul 12, 2019

breznak mentioned this pull request Jul 12, 2019

SP compute() const, thread safe #560

Draft

4 tasks

breznak added 5 commits July 13, 2019 13:29

Merge branch 'master_community' into parallel_ts

8f03604

Merge branch 'master_community' into parallel_ts

048d855

add Parallelizable header

a4e0113

SP: global inhibition: parallel

230905c

try execution::par_unseq for inhibition in SP. Results: takes much longer than seq.

Revert "MNIST: provide both single, parallel version"

ab572cd

This reverts commit b27b40c.

breznak commented Jul 17, 2019

View reviewed changes

breznak mentioned this pull request Nov 1, 2019

Provide effective way to query presynaptic cells from Connections #668

Closed

3 tasks

breznak added 2 commits April 10, 2020 18:43

Merge branch 'master_community' into parallel_ts

2a6ddc4

Merge remote-tracking branch 'community/master' into parallel_ts

1918f2d

Merge branch 'master_community' into parallel_ts

985ac1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel algorithms POC #559

Parallel algorithms POC #559

breznak commented Jul 12, 2019

breznak left a comment

breznak Jul 17, 2019

breznak commented Aug 6, 2020

Parallel algorithms POC #559

Are you sure you want to change the base?

Parallel algorithms POC #559

Conversation

breznak commented Jul 12, 2019

breznak left a comment

Choose a reason for hiding this comment

breznak Jul 17, 2019

Choose a reason for hiding this comment

breznak commented Aug 6, 2020