feat: Add parse option for char-based columns in Sourcepos by Martin005 · Pull Request #779 · kivikakk/comrak

Martin005 · 2026-03-24T10:09:13Z

This PR introduces a new parsing option to report source position columns as Unicode character counts instead of UTF-8 byte offsets.

I created a lot of tests for different node values / extensions, but the tests are not exhaustive. Please let me know if I should cover more things 🙂

Closes #777

codspeed-hq · 2026-03-24T10:12:32Z

Merging this PR will not alter performance

✅ 1 untouched benchmark

_{Comparing Martin005:sourcepos_chars (583d092) with main (0d4a9ca)¹}

No successful run was found on main (583d092) during the generation of this report, so 0d4a9ca was used instead as the comparison base. There might be some changes unrelated to this pull request in this report. ↩

kivikakk

Thank you, this looks great! 🤍

kivikakk · 2026-03-29T00:39:05Z

src/parser/mod.rs

+        if lc.column == 0 {
+            return;
+        }
+        if let Some(line) = lines.get(lc.line.wrapping_sub(1)) {


wrapping_sub cannot produce a useful result if lc.line is zero here (18446744073709551615); it would make more sense to give an underflow error than an out-of-bounds one.

Oh, that's right, thanks for noticing. Should I cover that in another PR? 🙂

I generally consider such non-blocking comments as: you can feel free to fix if you like, whether in its own PR or just whatever one you happen to do next; and maybe I will if I happen to write a PR myself sometime soon and remember :)

kivikakk · 2026-03-29T00:46:39Z

I created a lot of tests for different node values / extensions, but the tests are not exhaustive. Please let me know if I should cover more things 🙂

Your tests are always super exhaustive to my eyes; I really appreciate your efforts!

Martin005 · 2026-03-29T07:17:27Z

I created a lot of tests for different node values / extensions, but the tests are not exhaustive. Please let me know if I should cover more things 🙂

Your tests are always super exhaustive to my eyes; I really appreciate your efforts!

By not being exhaustive, I meant that the test cases don't test every single node value / extension. And there isn't a case for a long Markdown document with multiple node values.
But I don't think there should be a situation where the sourcepos_chars gives an incorrect value as the implementation is very straightforward (famous last words 😀)

Martin005 changed the title ~~feat: Add support for char-based columns in Sourcepos~~ feat: Add parse option for char-based columns in Sourcepos Mar 24, 2026

kivikakk approved these changes Mar 29, 2026

View reviewed changes

Martin005 added 3 commits March 29, 2026 11:43

feat: Add support for char-based columns in Sourcepos

996f68a

feat: Add conditional assertion for shortcodes in AstMatchTree

0cd1578

feat: Add sourcepos_chars option to FuzzParseOptions

583d092

kivikakk force-pushed the sourcepos_chars branch from 056a8f9 to 583d092 Compare March 29, 2026 00:43

kivikakk enabled auto-merge March 29, 2026 00:43

kivikakk merged commit 79cd65d into kivikakk:main Mar 29, 2026
26 checks passed

Martin005 deleted the sourcepos_chars branch March 29, 2026 06:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add parse option for char-based columns in Sourcepos#779

feat: Add parse option for char-based columns in Sourcepos#779
kivikakk merged 3 commits intokivikakk:mainfrom
Martin005:sourcepos_chars

Martin005 commented Mar 24, 2026

Uh oh!

codspeed-hq bot commented Mar 24, 2026 •

edited

Loading

Uh oh!

kivikakk left a comment

Uh oh!

kivikakk Mar 29, 2026

Uh oh!

Martin005 Mar 29, 2026

Uh oh!

kivikakk Mar 30, 2026

Uh oh!

Uh oh!

kivikakk commented Mar 29, 2026

Uh oh!

Martin005 commented Mar 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Martin005 commented Mar 24, 2026

Uh oh!

codspeed-hq bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Footnotes

Uh oh!

kivikakk left a comment

Choose a reason for hiding this comment

Uh oh!

kivikakk Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

Martin005 Mar 29, 2026

Choose a reason for hiding this comment

Uh oh!

kivikakk Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kivikakk commented Mar 29, 2026

Uh oh!

Martin005 commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codspeed-hq bot commented Mar 24, 2026 •

edited

Loading

Martin005 commented Mar 29, 2026 •

edited

Loading