feat: add transaction support (BEGIN/COMMIT/ROLLBACK) with WAL recovery by samarjeet818 · Pull Request #137 · aviralgarg05/NexumDB

samarjeet818 · 2026-02-13T12:21:23Z

Summary

add transaction SQL support for BEGIN, BEGIN TRANSACTION, COMMIT, and ROLLBACK
implement executor transaction state with transaction ID tracking and single active transaction guard
add WAL-backed snapshot recovery:
- write snapshot on BEGIN
- restore snapshot on ROLLBACK
- recover uncommitted transaction snapshot on startup
wire CLI help and output formatting (human/json) for transaction events
extend tests for parser, executor transaction flow, rollback/commit behavior, and recovery
fix storage batch key removal to use key.as_slice() for sled compatibility

Testing

cargo fmt --all -- --check
cargo +stable-x86_64-pc-windows-gnu clippy --workspace --all-targets -- -D warnings
cargo +stable-x86_64-pc-windows-gnu doc --workspace --no-deps
ruff check nexum_ai
python -m compileall -q nexum_ai
cd nexum_ai && pytest --cov=. --cov-report=xml --cov-report=term-missing (97 passed)

Closes #22

Summary by CodeRabbit

New Features
- Transaction control: BEGIN, COMMIT, and ROLLBACK commands now available for managing multi-step database operations.
- Automatic recovery: Uncommitted transactions are recovered on restart to ensure data consistency.
- Enhanced output: Transaction status messages displayed in CLI results.
Tests
- Added integration tests for transaction workflows including rollback and commit scenarios.

coderabbitai · 2026-02-13T12:21:41Z

Warning

Rate limit exceeded

@samarjeet818 has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 9 minutes and 19 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

Walkthrough

This pull request implements transaction support by adding BEGIN/COMMIT/ROLLBACK commands with write-ahead logging (WAL) for durability. Changes span the parser, planner, executor, storage engine, and CLI layers. Transaction state is tracked via WAL, with recovery logic to restore state after crashes.

Changes

Cohort / File(s)	Summary
SQL Parser & Type System `nexum_core/src/sql/parser.rs`, `nexum_core/src/sql/types.rs`, `nexum_core/src/sql/planner.rs`	Added recognition of BEGIN, BEGIN TRANSACTION, COMMIT, and ROLLBACK tokens. Introduced BeginTransaction, CommitTransaction, and RollbackTransaction as new Statement and Plan variants.
Executor Transaction Logic `nexum_core/src/executor/mod.rs`	Implemented WAL-backed transaction support with TxWalFile struct, TransactionState tracking, and lifecycle methods (begin_transaction, commit_transaction, rollback_transaction). Integrated transaction writes into INSERT, UPDATE, DELETE, and table operations. Added crash recovery logic via startup WAL scanning.
Storage Engine WAL Support `nexum_core/src/storage/engine.rs`	Extended StorageEngine with wal_path field and corresponding utilities: scan_all, delete_keys, and wal_path accessor. Updated initialization to set WAL file path for file-backed engines and None for in-memory engines.
CLI Result Output `nexum_cli/src/main.rs`	Added ExecutionResult variants (TransactionBegan, TransactionCommitted, TransactionRolledBack) with corresponding JSON and formatted output handlers. Extended help text to document transaction control commands.
Integration Tests `tests/integration_test.rs`	Added test_transaction_commands_flow to verify full transaction lifecycle: begin, insert with rollback validation, and insert with commit persistence.

Sequence Diagram

sequenceDiagram
    participant Client
    participant Parser
    participant Executor
    participant WAL as WAL File
    participant Storage as Storage Engine

    Client->>Parser: BEGIN TRANSACTION
    Parser->>Executor: ExecutionResult(BeginTransaction)
    Executor->>Executor: Create TransactionState
    Executor->>WAL: Store initial snapshot
    Executor-->>Client: TransactionBegan { tx_id }

    Client->>Parser: INSERT/UPDATE/DELETE
    Parser->>Executor: Execute operation
    Executor->>Storage: Perform write
    Executor->>WAL: Record transaction write
    Executor-->>Client: OperationResult { writes }

    Client->>Parser: COMMIT
    Parser->>Executor: ExecutionResult(CommitTransaction)
    Executor->>WAL: Mark transaction as committed
    Executor->>Storage: Flush all writes
    Executor-->>Client: TransactionCommitted { tx_id, writes }

    alt Crash Scenario
        Executor->>Executor: Startup recovery
        Executor->>WAL: Read uncommitted tx from WAL
        Executor->>Storage: Restore from snapshot
        Executor->>WAL: Clear WAL entry
    end

    Client->>Parser: ROLLBACK
    Parser->>Executor: ExecutionResult(RollbackTransaction)
    Executor->>Storage: Restore from snapshot
    Executor->>WAL: Clear transaction entry
    Executor-->>Client: TransactionRolledBack { tx_id }

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

🚥 Pre-merge checks | ✅ 5 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 28.95% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title 'feat: add transaction support (BEGIN/COMMIT/ROLLBACK) with WAL recovery' directly and accurately summarizes the main changes, clearly indicating transaction feature additions with WAL recovery support.
Linked Issues check	✅ Passed	The PR implements all coding requirements from issue `#22`: SQL commands (BEGIN/COMMIT/ROLLBACK) parsing and execution, WAL-backed persistence, transaction ID tracking, and crash recovery via WAL. Isolation via locking/MVCC is partially addressed via single-active-transaction guard.
Out of Scope Changes check	✅ Passed	All changes align with issue `#22` objectives. Parser, planner, executor, storage, and CLI modifications directly support transaction feature. Integration tests validate transaction flow. Storage batch key removal fix is a minor compatibility improvement supporting the feature.
Merge Conflict Detection	✅ Passed	✅ No merge conflicts detected when merging into `main`

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: unknown <010samarjeet@gamil.com>

coderabbitai

Actionable comments posted: 8

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

nexum_core/src/executor/mod.rs (1)

85-91: ⚠️ Potential issue | 🟠 Major

Writes within a transaction are immediately visible to reads — no isolation.

The linked issue #22 mentions "Provide isolation via MVCC or an initial simpler locking mechanism," but the current implementation applies writes directly to storage during a transaction. Any SELECT (even from a hypothetical concurrent session) will see uncommitted data. This is effectively a "dirty read" isolation level.

Consider at minimum documenting this limitation, or buffering writes in the TransactionState and only applying them on COMMIT.

🤖 Fix all issues with AI agents

In `@nexum_core/src/executor/mod.rs`:
- Around line 40-41: The tx_state field currently uses
RefCell<Option<TransactionState>> which makes Executor !Send and !Sync; replace
it with std::sync::Mutex<Option<TransactionState>> (keep tx_counter as
AtomicU64) so Executor becomes thread-compatible; update all uses of
tx_state.borrow()/borrow_mut() to tx_state.lock().unwrap() (or handle
PoisonError) and adjust any pattern matching or option manipulation accordingly
in the Executor implementation and methods that reference
tx_state/TransactionState.
- Around line 508-531: The BEGIN implementation in begin_transaction uses
storage.scan_all() to create a full in-memory and on-disk TxWalFile snapshot
(TxWalFile, write_transaction_wal, tx_state), which is O(n) and won't scale;
update the codebase by adding a clear TODO comment inside begin_transaction
explaining this known limitation, reference scan_all() and TxWalFile usage, and
add an entry to project documentation or CHANGELOG stating that the current WAL
is a full-snapshot MVP and must be replaced with an incremental undo/redo
mutation log; also create or link a tracking issue/placeholder task ID in the
comment pointing to a future refactor to per-mutation WAL so reviewers and
future contributors can find and prioritize the change.
- Around line 570-574: record_transaction_write currently no-ops outside a
transaction and increments tx_state.write_count by one per statement, but you
should change it to track actual row mutations: modify fn
record_transaction_write to accept a rows: usize parameter and add rows to
self.tx_state.borrow_mut().as_mut().write_count (preserving the no-op when
tx_state is None), then update all call sites that currently call
record_transaction_write() to pass the number of rows affected (e.g., where rows
are known in mutation handlers or execution paths), ensuring the symbol names
record_transaction_write, tx_state, and write_count are used so the compiler
guides you to adjust every usage.
- Around line 637-657: restore_snapshot currently deletes missing keys then
rewrites the entire snapshot (using snapshot.to_vec()), which is expensive and
clones the whole slice; change restore_snapshot to compute a diff: call
self.storage.scan_all() and build a HashMap from the incoming snapshot slice for
lookups, then build two small collections — keys_to_delete (keys present in
current but not in snapshot) and entries_to_set (only those (k,v) from snapshot
where either k is missing in current or value differs) — call
self.storage.delete_keys(&keys_to_delete)? and
self.storage.batch_set(entries_to_set)? (collect entries_to_set into a
Vec<(Vec<u8>,Vec<u8>)> by cloning only the changed/new entries, not the entire
snapshot), then self.storage.flush()?; update code around restore_snapshot,
scan_all, delete_keys, batch_set, and flush accordingly to avoid the
unconditional full rewrite and the snapshot.to_vec() clone.
- Around line 533-551: commit_transaction currently clones the entire
state.snapshot into TxWalFile just to mark the WAL entry committed and
immediately deletes it; remove that unnecessary clone by constructing the
committed TxWalFile without the snapshot (or with None) before calling
write_transaction_wal, so you still pass tx_id and committed: true but avoid
using state.snapshot.clone(); update references in commit_transaction to stop
reading state.snapshot.clone() and ensure write_transaction_wal and TxWalFile
can accept a missing/None snapshot for committed entries, then keep
clear_transaction_wal() and resetting tx_state as-is.
- Around line 611-623: The write_transaction_wal function currently uses
fs::write which doesn't fsync; change it to open the WAL path with File::create
(using the same path from self.storage.wal_path()), write the
serde_json::to_vec(wal) bytes via write_all, then call sync_all on the file to
flush to disk (and map any IO/serde errors to StorageError::WriteError as
before); keep the existing parent directory creation logic and error mapping,
but replace the single fs::write call with the create/write_all/sync_all
sequence to guarantee WAL durability for TxWalFile.

In `@nexum_core/src/sql/parser.rs`:
- Around line 217-234: The parser currently recognizes single-word "commit" and
"rollback" but not the two-token variants "commit transaction" and "rollback
transaction"; update the token checks in nexum_core::sql::parser (the matching
block that returns Statement::BeginTransaction / Statement::CommitTransaction /
Statement::RollbackTransaction) to mirror the existing two-token handling used
for "begin transaction" by adding conditions that check tokens.len() == 2 &&
tokens[0].eq_ignore_ascii_case("commit") &&
tokens[1].eq_ignore_ascii_case("transaction") to return
Statement::CommitTransaction and likewise tokens.len() == 2 &&
tokens[0].eq_ignore_ascii_case("rollback") &&
tokens[1].eq_ignore_ascii_case("transaction") to return
Statement::RollbackTransaction so the parser accepts both single- and two-token
forms.

In `@nexum_core/src/storage/engine.rs`:
- Around line 13-18: The WAL file is currently created inside sled's managed
directory (wal_path: Some(db_path.join("nexum_tx_wal.json"))) which can conflict
with sled; change the WAL path in the constructor to be a sibling in the parent
directory (derive parent = db_path.parent() and set wal_path to
parent.join("nexum_tx_wal.json"), handling the case where parent is None by
falling back to db_path or returning an error) so the file is outside sled's
directory; update the code that reads/writes wal_path accordingly (referencing
wal_path and the constructor that returns Self { db, wal_path }) and add unit
tests validating scan_all() and delete_keys() behavior to catch regressions.

Signed-off-by: unknown <010samarjeet@gamil.com>

samarjeet818 · 2026-02-13T13:34:33Z

All checks are passing and there are no merge conflicts.
Please review the changes and let me know if any modifications are required.
Thanks!

aviralgarg05

LGTM!

samarjeet818 requested a review from aviralgarg05 as a code owner February 13, 2026 12:21

github-actions Bot added documentation Improvements or additions to documentation rust python tests ai executor cli storage sql size/XL labels Feb 13, 2026

feat: implement transaction support with WAL recovery

c9861e9

Signed-off-by: unknown <010samarjeet@gamil.com>

samarjeet818 force-pushed the feat/add-explain-query-plan branch from 4934a75 to c9861e9 Compare February 13, 2026 12:24

github-actions Bot added size/L and removed documentation Improvements or additions to documentation python ai labels Feb 13, 2026

coderabbitai Bot reviewed Feb 13, 2026

View reviewed changes

fix: address transaction review feedback from CodeRabbit

c0f585c

Signed-off-by: unknown <010samarjeet@gamil.com>

github-actions Bot added the documentation Improvements or additions to documentation label Feb 13, 2026

aviralgarg05 approved these changes Feb 13, 2026

View reviewed changes

aviralgarg05 merged commit 1713219 into aviralgarg05:main Feb 13, 2026
24 checks passed

github-actions Bot mentioned this pull request Feb 11, 2026

chore(main): release 0.7.0 #125

Open

aviralgarg05 added OSCG26 medium Intermediate difficulty hard Complex, requires deep understanding and removed medium Intermediate difficulty labels Feb 13, 2026

coderabbitai Bot mentioned this pull request Feb 15, 2026

test(sql): add quoted-identifier tests for DESCRIBE/DROP #141

Open

This was referenced Feb 15, 2026

perf(storage): add table row-count helper API #153

Open

test: add atomicity regression tests for UPDATE/DELETE failures #154

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add transaction support (BEGIN/COMMIT/ROLLBACK) with WAL recovery#137

feat: add transaction support (BEGIN/COMMIT/ROLLBACK) with WAL recovery#137
aviralgarg05 merged 2 commits into
aviralgarg05:mainfrom
samarjeet818:feat/add-explain-query-plan

samarjeet818 commented Feb 13, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Feb 13, 2026 •

edited

Loading

Rate limit exceeded

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samarjeet818 commented Feb 13, 2026

Uh oh!

aviralgarg05 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

samarjeet818 commented Feb 13, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samarjeet818 commented Feb 13, 2026

Uh oh!

aviralgarg05 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

samarjeet818 commented Feb 13, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Feb 13, 2026 •

edited

Loading