Bug report: Unescape Unicode Characters only accepts exactly 4 hex digits for U+

**Describe the bug**
The `Unescape Unicode Characters` operation fails to decode valid code points with more than 4 hex digits when using the `U+` prefix, breaking support for astral plane characters like emoji.

`src/core/operations/UnescapeUnicodeCharacters.mjs`, `run()` method, line 55

```javascript
run(input, args) {
    const prefix = prefixToRegex[args[0]],
        regex = new RegExp(prefix+"([a-f\\d]{4})", "ig");
    // ...
}
```

The regex is hardcoded to exactly 4 hex digits for all prefixes. This rejects notation like `U+1F600` (😀) and `U+000041` (zero-padded A). It also breaks round-trips. `Escape Unicode Characters` can emit 6-digit output like `U+000041` when configured with `Padding: 6`, but `Unescape` cannot decode it.


**To Reproduce**
add `Unescape Unicode Characters` with prefix `U+`, input `U+1F600`. Expected: `😀`. Actual: no match.

**Screenshots**

<img width="504" height="266" alt="Image" src="https://github.com/user-attachments/assets/7863ea48-4b5d-4596-9a53-7791ee07e351" />

<img width="502" height="355" alt="Image" src="https://github.com/user-attachments/assets/9f6d524a-dfce-45a2-a774-62d74f528173" />


**Additional context**
Proposed fix widens the quantifier for `U+` only:

```javascript
run(input, args) {
    const prefix = prefixToRegex[args[0]],
        regex = args[0] === "U+"
            ? new RegExp(prefix+"([a-f\\d]{4,6})", "ig")
            : new RegExp(prefix+"([a-f\\d]{4})", "ig");
    // ...
}
```

Standard `U+` notation allows variable-length hex sequences from 4 to 6 digits. The `\u` and `%u` forms are legacy and expect exactly 4 digits (or surrogate pairs), so they retain the fixed-length requirement.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug report: Unescape Unicode Characters only accepts exactly 4 hex digits for U+ #2242

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Bug report: Unescape Unicode Characters only accepts exactly 4 hex digits for U+ #2242

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions