Build: #1 failed First build for this plan

Stages & jobs

  1. Default Stage

  2. Test

Build result summary

Details

Completed
Queue duration
30 seconds
Duration
33 minutes
Labels
None
Revision
826c31e116218750e0d4bd9668df49796f903cfe
Fixed in
#2 (Changes by Rui Xue <rx.astro@gmail.com>)
No failed test found. A possible compilation error occurred.

Responsible

No one has taken responsibility for this failure

Code commits

Author Commit Message Commit date
Rui Xue <rx.astro@gmail.com> Rui Xue <rx.astro@gmail.com> 826c31e116218750e0d4bd9668df49796f903cfe CAS-14756: Align image tile shape to MPI channel boundaries and disable explicit file locks
The default casacore TiledShape could produce tiles that are
small spatially but span many spectral channels. When mpicasa workers
write to disjoint channel ranges of shared .residual/.psf/.sumwt images,
their writes collide on the same physical disk tiles, causing Lustre
table.lock contention that serializes all I/O (observed: ~22.5h on a
29,861-channel dataset).

SIImageStore::buildImage():
- When chnchnks > 1 in itsMiscInfo, override PagedImage tile shape with
  TiledShape(min(Nx,256), min(Ny,256), Npol, Nchan/chnchnks), ensuring
  each MPI worker's channel range maps to independent tile blocks.
- Serial mode (chnchnks absent or 1) falls back to the original
  PagedImage(shape, csys, name) constructor with casacore defaults.

CubeMajorCycleAlgorithm:
- subImageStore(): disable sumwt.lock(FileLocker::Write) — sumwt is only
  read for its shape, never written here.
- getSubImage(): disable im->lock(FileLocker::Read) and im->unlock() —
  data is copied into a TempImage; no persistent write occurs.
- writeBackToFullImage(): disable LatticeLocker and im->unlock() —
  workers write non-overlapping SubImage channel slices; tile alignment
  guarantees no shared disk blocks.

Anticipate no metadata race risk: workers never call setImageInfo/setMiscInfo on
disk images. All normalization operates on in-memory TempImage copies.
Beam sets are serialized via MPI returnRec and merged by Rank 0 after
all workers complete.

Jira issues

IssueDescriptionStatus
Unknown Issue TypeCAS-14756Could not obtain issue details from Jira

Shared artifacts

Artifact File size
ManyLinux228 Casatestutils 144 KB

Error summary for ManyLinux 2.28

The job generated some errors, drill down into the full build log for more details.

error: pathspec 'CAS-14756' did not match any file(s) known to git.
Checking out files:  25% (1657/6536)   
Checking out files:  26% (1700/6536)   
Checking out files:  27% (1765/6536)   
Checking out files:  28% (1831/6536)   
Checking out files:  29% (1896/6536)   
Checking out files:  30% (1961/6536)   
Checking out files:  31% (2027/6536)   
Checking out files:  32% (2092/6536)   
Checking out files:  33% (2157/6536)   
Checking out files:  34% (2223/6536)   
Checking out files:  35% (2288/6536)   
Checking out files:  36% (2353/6536)   
Checking out files:  37% (2419/6536)   
Checking out files:  38% (2484/6536)   
Checking out files:  39% (2550/6536)   
Checking out files:  40% (2615/6536)   
Checking out files:  41% (2680/6536)   
Checking out files:  42% (2746/6536)   
Checking out files:  43% (2811/6536)   
Checking out files:  44% (2876/6536)   
Checking out files:  45% (2942/6536)   
Checking out files:  46% (3007/6536)   
Checking out files:  47% (3072/6536)   
Checking out files:  48% (3138/6536)   
Checking out files:  49% (3203/6536)   
Checking out files:  50% (3268/6536)   
Checking out files:  51% (3334/6536)   
Checking out files:  52% (3399/6536)   
Checking out files:  53% (3465/6536)   
Checking out files:  54% (3530/6536)   
Checking out files:  55% (3595/6536)   
Checking out files:  56% (3661/6536)   
Checking out files:  56% (3677/6536)   
Checking out files:  57% (3726/6536)   
Checking out files:  58% (3791/6536)   
Checking out files:  59% (3857/6536)   
Checking out files:  60% (3922/6536)   
Checking out files:  61% (3987/6536)   
Checking out files:  62% (4053/6536)   
Checking out files:  63% (4118/6536)   
Checking out files:  64% (4184/6536)   
Checking out files:  65% (4249/6536)   
Checking out files:  66% (4314/6536)   
Checking out files:  67% (4380/6536)   
Checking out files:  68% (4445/6536)   
Checking out files:  69% (4510/6536)   
Checking out files:  70% (4576/6536)   
Checking out files:  71% (4641/6536)   
Checking out files:  72% (4706/6536)   
Checking out files:  73% (4772/6536)   
Checking out files:  74% (4837/6536)   
Checking out files:  75% (4902/6536)   
Checking out files:  76% (4968/6536)   
Checking out files:  77% (5033/6536)   
Checking out files:  78% (5099/6536)   
Checking out files:  79% (5164/6536)   
Checking out files:  80% (5229/6536)   
Checking out files:  81% (5295/6536)   
Checking out files:  82% (5360/6536)   
Checking out files:  82% (5392/6536)   
Checking out files:  83% (5425/6536)   
Checking out files:  84% (5491/6536)   
Checking out files:  85% (5556/6536)   
Checking out files:  86% (5621/6536)   
Checking out files:  87% (5687/6536)   
Checking out files:  88% (5752/6536)   
Checking out files:  89% (5818/6536)   
Checking out files:  90% (5883/6536)   
Checking out files:  91% (5948/6536)   
Checking out files:  92% (6014/6536)   
Checking out files:  93% (6079/6536)   
Checking out files:  94% (6144/6536)   
Checking out files:  95% (6210/6536)   
Checking out files:  96% (6275/6536)   
Checking out files:  97% (6340/6536)   
Checking out files:  98% (6406/6536)   
Checking out files:  99% (6471/6536)   
Checking out files: 100% (6536/6536)   
Checking out files: 100% (6536/6536), done.
Switched to a new branch 'CAS-14756'
error: pathspec 'CAS-14756' did not match any file(s) known to git.
Error response from daemon: No such container: casalith_container
Error response from daemon: No such container: casalith_container
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
43 6002k   43 2609k    0     0  6416k      0 --:--:-- --:--:-- --:--:-- 6474k
100 6002k  100 6002k    0     0  13.1M      0 --:--:-- --:--:-- --:--:-- 13.2M
[23902] Failed to execute script 'atlutil' due to unhandled exception!
Traceback (most recent call last):
  File "atlutil.py", line 200, in <module>
  File "atlutil.py", line 165, in has_fix_version
  File "json/__init__.py", line 354, in loads
  File "json/decoder.py", line 339, in decode
  File "json/decoder.py", line 357, in raw_decode
json.decoder.JSONDecodeError: Expecting value: line 12 column 1 (char 11)
ls: cannot access *.whl: No such file or directory
error: pathspec 'CAS-14756' did not match any file(s) known to git.
Unable to find executable: /source/casa6/casatools/src/code/build/calanalysis/tCalStats0
Unable to find executable: /source/casa6/casatools/src/code/build/synthesis/tCalLibraryParse
Unable to find executable: /source/casa6/casatools/src/code/build/synthesis/tCTIter
Unable to find executable: /source/casa6/casatools/src/code/build/synthesis/tGridFT
Unable to find executable: /source/casa6/casatools/src/code/build/synthesis/tAWPHPG
Unable to find executable: /source/casa6/casatools/src/code/build/synthesis/tVisModelDataRefim
Unable to find executable: /source/casa6/casatools/src/code/build/synthesis/tFFT2D
Errors while running CTest
Error response from daemon: No such container: build_container
Error response from daemon: No such container: build_container