Improved dbcs file api handling by 1000TurquoisePogs · Pull Request #592 · zowe/zowe-common-c

1000TurquoisePogs · 2026-04-23T15:21:55Z

Proposed changes

Fix the default target encoding used when serving USS file content via respondWithUnixFile2() in httpserver.c. Previously, on z/OS the target was hardcoded to ISO-8859-1 (819) for all text-type files regardless of their tagged CCSID. This corrupted content from files tagged as UTF-8 (1208), UTF-16 (1200/1201/1202), or any EBCDIC MIX code page (930, 933, 935, 937, 939, 1364, 1388, 1390, 1399).

The fix introduces isMultiByteCCSID(int ccsid) in charsets.c/charsets.h and uses it at runtime to select the target:

Single-byte source (e.g. IBM-1047, ISO-8859-1) → target remains ISO-8859-1 (819), no behaviour change.
Multi-byte source (UTF-8, UTF-16, EBCDIC MIX) → target is now UTF-8 (1208).

The OS-based #ifdef TBD comment is replaced with this runtime selection on z/OS. Non-z/OS platforms (Linux, AIX, Windows) continue to use UTF-8 as before.

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

PR Checklist

If the changes in this PR are meant for the next release / mainline, this PR targets the "staging" branch.
My code follows the style guidelines of this project (see: Contributing guideline)
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
New and existing unit tests pass locally with my changes
Relevant update to CHANGELOG.md
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works, or describe a test method below

Testing

Manual test — UTF-8 tagged file:

On z/OS, create a USS file containing UTF-8 encoded text (e.g. a file with Japanese or Chinese characters).
Tag it: chtag -tc 1208 <file>
GET /unixFileContents/<path> with no query parameters.
Verify the response bytes are valid UTF-8 and the non-Latin-1 characters are present and correct.
Before this fix, non-Latin-1 bytes would be corrupted or replaced.

Regression — IBM-1047 tagged file:

Create or use an existing USS file tagged CCSID 1047.
GET /unixFileContents/<path>.
Verify the response is the same ISO-8859-1 content as before this change.

Regression — untagged file:

Create or use an untagged USS file (chtag -b <file> to remove tag, or leave untagged).
GET /unixFileContents/<path>.
Verify the response is NATIVE_CODEPAGE (1047) → ISO-8859-1 converted output, same as before.

Unit test — isMultiByteCCSID():

Input	Expected
1208 (UTF-8)	TRUE
1200 (UTF-16)	TRUE
1201 (UTF-16BE)	TRUE
1202 (UTF-16LE)	TRUE
930 (EBCDIC MIX Japanese)	TRUE
933, 935, 937, 939	TRUE
1364, 1388, 1390, 1399	TRUE
1047 (IBM-1047)	FALSE
819 (ISO-8859-1)	FALSE
37 (IBM-037)	FALSE
0 (untagged)	FALSE
-1 / 65535 (binary)	FALSE

Further comments

The original code contained an explicit TBD comment acknowledging that the OS-based selection "isn't really an OS dependency". This PR resolves that TBD by selecting the target encoding at runtime based on isMultiByteCCSID().

The isMultiByteCCSID() function is intentionally conservative: it covers the Unicode encodings and the EBCDIC MIX (SBCS+DBCS) code pages that are realistic as USS file tags. Pure DBCS-only pages (300, 834, 835, 837) are excluded because they require SO/SI byte handling and are unlikely to appear as file-level CCSID tags in practice. The set can be extended in a follow-up if needed.

Signed-off-by: 1000TurquoisePogs <sgrady@rocketsoftware.com>

sonarqubecloud · 2026-04-23T15:23:49Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Improved dbcs file api handling

29cb413

Signed-off-by: 1000TurquoisePogs <sgrady@rocketsoftware.com>

github-project-automation Bot added this to zOS Squad Board Apr 23, 2026

Add pr numbers

42f618e

Signed-off-by: 1000TurquoisePogs <sgrady@rocketsoftware.com>

1000TurquoisePogs mentioned this pull request Apr 23, 2026

Update file api encoding behavior based on user preference #593

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved dbcs file api handling#592

Improved dbcs file api handling#592
1000TurquoisePogs wants to merge 2 commits into
v3.x/stagingfrom
feature/v3/dbcs-fileapi

1000TurquoisePogs commented Apr 23, 2026

Uh oh!

sonarqubecloud Bot commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

1000TurquoisePogs commented Apr 23, 2026

Proposed changes

Type of change

PR Checklist

Testing

Further comments

Uh oh!

sonarqubecloud Bot commented Apr 23, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant