WIP: Integrate ISA-L & Generalised Erasure Coding. by BlamKiwi · Pull Request #80 · koverstreet/bcachefs

BlamKiwi · 2019-12-01T04:36:29Z

Over the weekend I got ISA-L building and integrated CRC64 (5-15x speed-up Ryzen 2200G) as a quick proof point. I just want some quick feedback before tackling full Erasure Coding.

KBuild -
I've added ISA-L and EC as some boolean flags to KBuild. I assume you don't want EC support as a separate module?

Makefile -
The ISA-L code builds without modification from Intel's upstream. This has resulted in very verbose KBuild Special Rules due to the NASM dependency and unnecessary CRC implementations. I would be interested in advice for a better approach until I can port ISA-L to GAS and strip out unused code.

Accel.h/c -
This is a temporary integration point for accessing optimised primitives. I intend to move them to the appropriate kernel lib folders once everything is working.

MD-RAID Compatibility -
The website TODO list mentions Andrea Mazzoleni's technique of combining Vandermonde and Cauchy matrices to implement Erasure Coding compatible with MD-RAID. To begin with I won't be implementing this technique. When stuff is stable I will dig into those mathematics a bit.

We weren't checking for errors when trying to delet stripes, which meant ec_stripe_delete_work() would spin trying to delete the same stripe over and over. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

If there is only a single entry at 0, the first time we call xas_next(), we return the entry. Unfortunately, all subsequent times we call xas_next(), we also return the entry at 0 instead of noticing that the xa_index is now greater than zero. This broke find_get_pages_contig(). Fixes: 64d3e9a ("xarray: Step through an XArray") Reported-by: Kent Overstreet <kent.overstreet@gmail.com> Signed-off-by: Matthew Wilcox (Oracle) <willy@infradead.org>

There was a null ptr deref when there wasn't a stripes heap allocated Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Change it to not mark keys that will be overwritten by keys in the journal - this fixes a bug where we pop an assertion in bucket_set_stripe() because of a stale pointer - because the stripe that has the stale pointer has been deleted. This code could be factored out and used elsewhere, at some point. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Actual repair code will come later, but this is a start Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

With reflink, we'll no longer be able to calculate the offset of the data we want into the extent we're reading from from the extent pos and the iter pos - we'll have to pass it in separately. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

for_each_btree_key() calls bch2_trans_get_iter() - we have to reset the transaction state before getting the iterator again, in the retry path Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Where unlink_on_commit is used, on unsuccessfull commit we're likely retrying the whole update and were going to be using the same iterators again. The management of multiple iterators needs to be gone over a fair bit more at some point... Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Prep work for reflink - for reflink, we're going to be using bch2_extent_update() with other updates in the same transaction. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Minor cleanup - prep work for new key types for reflink Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

With reflink, various code now has to handle both KEY_TYPE_extent or KEY_TYPE_reflink_v - so, convert it to be generic across all keys with pointers. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

More prep work for reflink: for extents, we're not looking for an exact mach on pos, rather that the pos is within the range of the key the iterator points to. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bch2_btree_node_iter_prev_filter() tried to be smart about iterating backwards when skipping over whiteouts/discards - but unfortunately, doing so can leave the node iterator in an inconsistent state; the sane solution is to just always iterate backwards one key at a time. But we compact btree nodes when more than a quarter of the keys are whiteouts/discards, so the optimization wasn't buying us that much anyways. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

koverstreet and others added 30 commits November 18, 2019 11:48

bcachefs: kill page_state_cmpxchg

4a54b14

bcachefs: track dirtyness at sector level, not page

14aba43

bcachefs: Don't try to delete stripes when RO

65b3579

We weren't checking for errors when trying to delet stripes, which meant ec_stripe_delete_work() would spin trying to delete the same stripe over and over. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Fix stripe_idx_to_delete()

80d5ace

There was a null ptr deref when there wasn't a stripes heap allocated Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Convert some assertions to fsck errors

6415eed

Actual repair code will come later, but this is a start Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Don't overflow trans with iters from triggers

c75df65

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Print out name of bkey type

5987286

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: add missing bch2_trans_begin() call

179d885

for_each_btree_key() calls bch2_trans_get_iter() - we have to reset the transaction state before getting the iterator again, in the retry path Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Dont't call bch2_trans_begin_updates() in bch2_extent_update()

fee0227

Prep work for reflink - for reflink, we're going to be using bch2_extent_update() with other updates in the same transaction. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Refactor __bch2_cut_front()

72dad76

Minor cleanup - prep work for new key types for reflink Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Refactor various code to not be extent specific

c7aa700

With reflink, various code now has to handle both KEY_TYPE_extent or KEY_TYPE_reflink_v - so, convert it to be generic across all keys with pointers. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Fix bch2_seek_data()

7ade0fb

bcachefs: Change __bch2_writepage() to not write to holes

809926c

bcachefs: Change buffered write path to write to partial pages

9967ab5

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Handle partial pages in seek data/hole

bc8de43

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Count reserved extents as holes

ed3a627

bcachefs: Truncate/fpunch now works on block boundaries, not page

11381cd

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Export correct blocksize to vfs

1543b9c

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: trans_get_key() now works correctly for extents

5107bb5

More prep work for reflink: for extents, we're not looking for an exact mach on pos, rather that the pos is within the range of the key the iterator points to. Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: fix for_each_btree_key()

7e58452

bcachefs: Ensure bch2_trans_get_iter() returns iters with correct locks

4f0a3ff

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Mark space as unallocated on write failure

acc145f

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Rework calling convention for marking overwrites

a931021

bcachefs: Improved debug checks

9877d9d

Signed-off-by: Kent Overstreet <kent.overstreet@gmail.com>

bcachefs: Fix __bch2_btree_iter_peek_slot_extents()

1559231

koverstreet force-pushed the master branch 4 times, most recently from 8f49267 to 0b5e3ee Compare March 22, 2021 22:56

koverstreet force-pushed the master branch 5 times, most recently from ad68801 to 6e8f25f Compare March 25, 2021 03:52

koverstreet force-pushed the master branch 2 times, most recently from 83dd3db to 6a3927a Compare April 5, 2021 02:43

koverstreet force-pushed the master branch 3 times, most recently from b0f77a0 to f2700b9 Compare April 27, 2021 16:30

koverstreet force-pushed the master branch from 769ec49 to a5c0e1b Compare April 30, 2021 20:48

koverstreet force-pushed the master branch 2 times, most recently from da5ffff to 75a3eb8 Compare May 23, 2021 02:44

koverstreet force-pushed the master branch from e3a7cee to e6acff9 Compare June 10, 2021 00:05

koverstreet force-pushed the master branch from ddc5930 to dbee44d Compare July 6, 2021 17:10

koverstreet force-pushed the master branch from 7c71c56 to fc831fc Compare September 8, 2021 18:51

koverstreet force-pushed the master branch 2 times, most recently from 4b2d093 to 45665ce Compare November 4, 2021 16:28

koverstreet force-pushed the master branch from 0c2084d to e29c940 Compare November 24, 2021 00:02

koverstreet force-pushed the master branch 5 times, most recently from 16cbc9a to 8fc58b1 Compare December 21, 2021 23:04

koverstreet force-pushed the master branch 2 times, most recently from fa97ffc to a4c0a23 Compare December 27, 2021 04:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Integrate ISA-L & Generalised Erasure Coding.#80

WIP: Integrate ISA-L & Generalised Erasure Coding.#80
BlamKiwi wants to merge 367 commits intokoverstreet:masterfrom
BlamKiwi:isal

BlamKiwi commented Dec 1, 2019 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

BlamKiwi commented Dec 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

BlamKiwi commented Dec 1, 2019 •

edited

Loading