Commit 53dd595
authored
fix(#1442): scan_*_references reads raw JSON metadata, not decoded codec output
scan_hash_references and scan_schema_references called table.to_arrays(attr_name),
which routes through decode_attribute and returns the codec's decoded payload
(numpy.ndarray, NpyRef, ObjectRef, bytes, or local path str). None of those
satisfy _extract_*_refs's `isinstance(value, dict) and "path" in value` check,
so both helpers silently returned empty reference sets and gc.collect() would
have classified live data as orphaned.
Replace with table.proj(attr_name).cursor(as_dict=True). The cursor yields
the raw JSON column value: a dict on PostgreSQL/JSONB or a JSON string on
MySQL — both already handled by _extract_*_refs (gc.py:138 string branch,
gc.py:145 dict branch). Backend-agnostic, custom-codec-safe, and turns scan
into a metadata-only operation (no more downloading every external blob to
discard the deserialized result).
Also registers gc in _lazy_modules (src/datajoint/__init__.py) so
dj.gc.scan(...) works as documented in the gc module docstring and in
how-to/garbage-collection.md, matching the existing diagram entry pattern.
Test scaffolding adds TestScanWithLiveData with three non-mocked e2e tests
covering structurally distinct decoded-value types (<blob@> → ndarray,
<npy@> → NpyRef, <object@> → ObjectRef). The same tests fail on the buggy
version, proving the regression. The 26 existing mocked tests stay intact
for orchestration coverage.
GC remains non-transaction-safe (TOCTOU window between scan and delete);
a two-phase quarantine→grace→purge API is the right remedy and will be
tracked as a separate enhancement.1 parent dfbcfd8 commit 53dd595
3 files changed
Lines changed: 135 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
275 | 275 | | |
276 | 276 | | |
277 | 277 | | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
278 | 281 | | |
279 | 282 | | |
280 | 283 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
229 | 229 | | |
230 | 230 | | |
231 | 231 | | |
232 | | - | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
233 | 236 | | |
234 | | - | |
235 | | - | |
236 | | - | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
237 | 240 | | |
238 | 241 | | |
239 | 242 | | |
| |||
291 | 294 | | |
292 | 295 | | |
293 | 296 | | |
294 | | - | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
295 | 301 | | |
296 | | - | |
297 | | - | |
298 | | - | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
299 | 305 | | |
300 | 306 | | |
301 | 307 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4 | 4 | | |
5 | 5 | | |
6 | 6 | | |
| 7 | + | |
7 | 8 | | |
8 | 9 | | |
| 10 | + | |
9 | 11 | | |
10 | 12 | | |
11 | 13 | | |
12 | 14 | | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
13 | 44 | | |
14 | 45 | | |
15 | 46 | | |
| |||
347 | 378 | | |
348 | 379 | | |
349 | 380 | | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
| 439 | + | |
| 440 | + | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
| 444 | + | |
| 445 | + | |
| 446 | + | |
| 447 | + | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
| 453 | + | |
| 454 | + | |
| 455 | + | |
| 456 | + | |
| 457 | + | |
| 458 | + | |
| 459 | + | |
| 460 | + | |
| 461 | + | |
| 462 | + | |
| 463 | + | |
| 464 | + | |
| 465 | + | |
| 466 | + | |
| 467 | + | |
0 commit comments