Explicit conflict marker detection by koppor · Pull Request #629 · JabRef/jabref-koppor

koppor · 2022-09-26T18:54:27Z

WIP, because the parsing architecture is a bit complicated here.

We cannot "just" read the whole file, because it could be very slow when reading large data bases.

Co-authored-by: Christoph <siedlerkiller@gmail.com> Co-authored-by: Carl Christian Snethlage <50491877+calixtus@users.noreply.github.com> Co-authored-by: Houssem Nasri <housi.housi2015@gmail.com>

Co-authored-by: Christoph <siedlerkiller@gmail.com> Co-authored-by: Carl Christian Snethlage <50491877+calixtus@users.noreply.github.com> Co-authored-by: Houssem Nasri <housi.housi2015@gmail.com> Co-authored-by: Benedikt Tutzer <btut@users.noreply.github.com>

koppor · 2022-10-11T18:25:15Z

Two implementation ideas:

Dive into the grammar and add "exception" paths for conflict markers
Do a pre-reading of the file and check for conflict markers
Spawn a parallel thread checking for conflict markers. If the normal reading thread returns, with an error, check what the checking thread said.

Think, we could go for 2 even though file loading might be slower?! -- https://www.amitph.com/java-read-write-large-files-efficiently/

HoussemNasri · 2022-10-23T22:45:33Z

+
+        ParserResult expected = ParserResult.fromErrorMessage("Found git conflict markers");
+
+        assertEquals(expected, parserResult);


I got the test passing by checking for git markers inside the read method after consuming a newline character.
I also called checkForGitMarkers at the beginning of the file to ensure it's called on the first line.

private int read() throws IOException { int character = pushbackReader.read(); if (!isEOFCharacter(character)) { pureTextFromFile.offerLast((char) character); } if (character == '\n') { line++; checkForGitConflictMarker(); } return character; }

This is the logic. It looks for a line that starts with the 'ours' marker, which is represented by the symbol <<<<<<<. Then it continues to skip lines until it reaches the 'theirs' marker >>>>>>>.

private void checkForGitConflictMarker() throws IOException { skipSpace(); int markerCount = 0; // Looking for the 'ours' marker char c; while ((c = (char) peek()) == '<' && !isEOFCharacter(c)) { read(); markerCount++; } if (markerCount == 7) { parserResult.addWarning("Found git conflict markers at line %d".formatted(line)); // Skip 'ours' marker <<<<<<< skipLine(); // Keep skipping lines until we hit the beginning of 'theirs' marker >>>>>>> while (peek() != '>' && !isEOFCharacter(peek())) { skipLine(); } // Skip 'theirs' marker if we haven't hit EOF already if (!isEOFCharacter(peek())) { skipLine(); } } } private void skipLine() throws IOException { while (peek() != '\n' && !isEOFCharacter(peek())) { read(); } skipOneNewline(); }

I had to modify the test slightly to pass because the logic I used would continue parsing after the marker, resulting in a parser result with two entries when the expected parser result is zero. I changed it to check whether the warning list contains the git conflict warning.

assertTrue(parserResult.warnings().contains("Found git conflict markers at line 3"));

Where is the commit? 🎉👀

I didn't make a commit. I got the inspiration of the solution while working on another PR, so I just made the changes on that PR's branch. However, you can use the code above inside BibTeXParser, and don't forget to set co-authored with me 😁.

koppor and others added 2 commits September 26, 2022 20:39

Rename "getErrorMessage()" to "getWarningsAsString()"

35b4185

Co-authored-by: Christoph <siedlerkiller@gmail.com> Co-authored-by: Carl Christian Snethlage <50491877+calixtus@users.noreply.github.com> Co-authored-by: Houssem Nasri <housi.housi2015@gmail.com>

HoussemNasri reviewed Oct 23, 2022

View reviewed changes

koppor force-pushed the main branch from 9799dc4 to d6a0e59 Compare October 27, 2023 01:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explicit conflict marker detection#629

Explicit conflict marker detection#629
koppor wants to merge 2 commits intomainfrom
fix-9167

koppor commented Sep 26, 2022

Uh oh!

koppor commented Oct 11, 2022

Uh oh!

HoussemNasri Oct 23, 2022

Uh oh!

koppor Nov 2, 2022

Uh oh!

HoussemNasri Nov 2, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		ParserResult expected = ParserResult.fromErrorMessage("Found git conflict markers");

		assertEquals(expected, parserResult);

Conversation

koppor commented Sep 26, 2022

Uh oh!

koppor commented Oct 11, 2022

Uh oh!

HoussemNasri Oct 23, 2022

Choose a reason for hiding this comment

Uh oh!

koppor Nov 2, 2022

Choose a reason for hiding this comment

Uh oh!

HoussemNasri Nov 2, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants