Build: #7 did not complete Manual run by CASA adm account

Build result summary

Details

Queued
Started
Completed
Status
Reason for not building is unknown
Labels
None
Revisions
Casa6
4b3bfca414918261df2401ecd4274572d83dbf8f
OPEN-CASA-PKG
e5285e68467f01199807a1f1a978aec8a4781be7

Comments on Build and package macOS 15 Python 3.12

  1. Anonymous user

    Detected hung build state. The build was detected as hung because the logs were quiet for "300" minutes and the build took "5.0" times longer than average. Attempting to generate stack trace and terminate spawned sub-processes.

Code commits

Casa6
Author Commit Message Commit date
Rui Xue <rx.astro@gmail.com> Rui Xue <rx.astro@gmail.com> 4b3bfca414918261df2401ecd4274572d83dbf8f CAS-14756: use AutoNoReadLocking(inspectInterval=1s) in writeBackToFullImage
NoLocking was allowing concurrent table.dat_tmp→table.dat renames to race
across MPI workers, causing "RegularFile::move error: No such file or
directory" in rank N's PagedImage destructor (job on cvpost124, build-5).

casacore's table.dat metadata file is always shared — even when tile data
blocks are strictly disjoint. NoLocking is therefore not safe for concurrent
writers.

AutoNoReadLocking with the default inspectInterval=0 is safe (the write lock
is held from first putSlice to flush), but autoRelease() becomes a no-op and
the entire writeback is serialized across workers — identical throughput to
UserLocking.

Setting inspectInterval=1.0 activates periodic lock inspection: every ~1 s
autoRelease() checks whether another worker wants the lock; if so it flushes
table.dat and releases, then the waiting worker acquires. Tile data I/O
interleaves across workers (disjoint blocks → no contention); only the brief
table.dat rename is serialized per transfer.

Jira issues

IssueDescriptionStatus
Unknown Issue TypeCAS-14756Could not obtain issue details from Jira