perf: adds AVX512 implementations of vector.Sum, vector.InnerProduct + assembly refactor by gbotrel · Pull Request #547 · Consensys/gnark-crypto

gbotrel · 2024-09-28T15:22:44Z

Description

In this PR:

factorization of existing assembly code (see refactor: move common assembly routine in subfolder #545 )
adds vector.Sum, vector.InnerProduct , vector.ScalarMul and vector.Mul partially derived from Dag Arne Osvik's work in github.com/a16z/vectorized-fields
tests

Benchmarks

Benchmarks on size from 16 to 16M values.

Seems results are better on AMD EPYC 9R14 (hpc7a, c7a, r7a) than on the intel xeon 8488c (r7i, ...).

hpc7a

r7i

c7a

* refactor: move common assembly routines in root * build: make linter happier * style: cosmetics * test: start fixing integration test * style: factorize mul documentation * feat: add .ASMVector and fix integartion test * test: fix 32bit test * test: fix previous commit

ivokub

Without trying to understand the assembly definitions, looks good to me. There are a few comments, but rather minor.

field/goff/cmd/root.go

internal/generator/tower/asm/amd64/e2.go

internal/generator/tower/asm/amd64/e2_bn254.go

field/generator/generator.go

field/generator/asm/amd64/asm_macros.go

field/generator/generator_test.go

gbotrel added 26 commits September 21, 2024 14:39

feat: asm Vector sum slower, no avx

d7aa178

checkpoint

6d2d8a8

checkpoint

0bb00a0

checkpoint

ce4ade2

feat: add vec.Sum AVX512

847a6df

test: make odd bound for better test case

4796eb3

build: make linter happy

dfcb110

fix: update bound for vec sum to match parameter choices

f45aeb9

perf: loop 8 by 8, cosmetics

75120a0

style: cosmetics

dc15a6a

test: better sum test

a66f547

test: more test

5b2b11d

doc: add reference for reduction algorithm

b27149d

feat: use latest bavard for avx512 instructions

159a7b7

feat: added purego InnerProduct

61268fe

checkpoint wip

71d68aa

checkpoint

8d21a8e

refactor: checkpoint

8a906c6

test: better tests for vec ops

a2e333c

checkpoint

78184a8

test: add more tests for vector ops

0a939b1

feat: update bavard and use better syntax in asm

85e509d

test: make benchmarks on varying sizes

5c202ac

test: bench on larger vector

97cdc21

test: bench on larger vector

48cca4c

gbotrel requested review from ThomasPiellard, ivokub and yelhousni and removed request for ThomasPiellard September 28, 2024 16:24

gbotrel requested a review from Tabaie September 28, 2024 16:24

gbotrel added 23 commits September 29, 2024 20:57

checkpoint

370e77b

checkpoint

452fe92

checkpoint

8ce8ecb

checkpoint

e104c20

checkpoint

e7a1eb0

checkpoint

b868789

checkpoint

8862854

checkpoint

5d212b6

checkpoint

6f063e8

checkpoint

8978198

checkpoint

89c5035

checkpoint

7336398

checkpoint

c56802b

checkpoint

d8be4fe

checkpoint

d7436cd

refactor: use defines for mul

f5065fd

feat: make use of defines in assembly

2b201da

checkpoint

76b236e

style: code cleaning

ccdec18

perf: prefetches in vec ops

24c1aa6

perf: minor adjustements

3b926b1

style: costmetics

d1601ba

feat: handle case where len(vec)==0

138484c

gbotrel requested a review from AlexandreBelling October 3, 2024 20:27

ivokub approved these changes Oct 4, 2024

View reviewed changes

fix: address PR review comments

a560ba4

gbotrel merged commit e26bbdf into master Oct 7, 2024

gbotrel deleted the avx512/innerproduct branch October 7, 2024 14:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: adds AVX512 implementations of vector.Sum, vector.InnerProduct + assembly refactor#547

perf: adds AVX512 implementations of vector.Sum, vector.InnerProduct + assembly refactor#547
gbotrel merged 50 commits intomasterfrom
avx512/innerproduct

gbotrel commented Sep 28, 2024 •

edited

Loading

Uh oh!

ivokub left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gbotrel commented Sep 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Benchmarks

Uh oh!

ivokub left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gbotrel commented Sep 28, 2024 •

edited

Loading