Build: #69 failed
Job: Test ManyLinux 2.28 Python 3.12 failed
mg2 20170525142607 180419 procedure hsdn calimage regression: Test case result
The below summarizes the result of the test " mg2 20170525142607 180419 procedure hsdn calimage regression" in build 69 of PIPESPECS - Test Pipeline main with Casa master - Test ManyLinux 2.28 Python 3.12.
- Description
- mg2 20170525142607 180419 procedure hsdn calimage regression
- Test class
- tests.regression.fast.nobeyama_sd_fast_test
- Method
- test_mg2_20170525142607_180419__procedure_hsdn_calimage__regression
- Duration
- 43 mins
- Status
- Failed (New Failure)
Error Log
Failed: Failed to match 2 result values within tolerances :
s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_0.qa.metric.score_sd_image_sensitivity_ratio
values differ by > a relative difference of 1e-07
expected: 1.427446433699322
new: 1.4275466845286737
diff: -0.00010025082935172236
percent_diff: -0.00702308871177153%
s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_1.qa.metric.score_sd_image_sensitivity_ratio
values differ by > a relative difference of 1e-07
expected: 1.3625947414487243
new: 1.361980167115621
diff: 0.0006145743331031905
percent_diff: 0.04510323681785011%
Worst absolute diff, s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_1.qa.metric.score_sd_image_sensitivity_ratio: 0.0006145743331031905
Worst percentage diff, s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_1.qa.metric.score_sd_image_sensitivity_ratio: 0.04510323681785011%
def test_mg2_20170525142607_180419__procedure_hsdn_calimage__regression():
"""Run ALMA single-dish cal+image regression for standard nobeyama recipe.
Recipe name: procedure_hsdn_calimage
Dataset: mg2-20170525142607-180419
"""
ref_directory = 'pl-regressiontest/mg2-20170525142607-180419'
pt = PipelineTester(
visname=['mg2-20170525142607-180419.ms'],
recipe='procedure_hsdn_calimage.xml',
input_dir=ref_directory,
expectedoutput_dir=ref_directory,
)
> pt.run()
tests/regression/fast/nobeyama_sd_fast_test.py:21:
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
tests/testing_utils.py:376: in run
self.__compare_results(new_file, default_relative_tolerance)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
self = <tests.testing_utils.PipelineTester object at 0x7f8afa4395e0>
new_file = 'mg2-20170525142607-180419.ms.NEW.results.txt'
relative_tolerance = 1e-07
def __compare_results(self, new_file: str, relative_tolerance: float) -> None:
"""
Compare results between new one loaded from file and old one.
Args:
new_file : file path of new results
relative_tolerance : relative tolerance of output value
"""
with open(self.expectedoutput_file) as expected_fd, open(new_file) as new_fd:
expected_results = expected_fd.readlines()
new_results = new_fd.readlines()
errors = []
worst_diff = (0, 0)
worst_percent_diff = (0, 0)
for old, new in zip(expected_results, new_results):
try:
oldkey, oldval, tol = self.__sanitize_results_string(old)
newkey, newval, _ = self.__sanitize_results_string(new)
except ValueError as e:
errorstr = "The results: {0} could not be parsed. Error: {1}".format(new, str(e))
errors.append(errorstr)
continue
assert oldkey == newkey, f"Expected key {oldkey} does not match new key {newkey}."
tolerance = tol if tol else relative_tolerance
if newval is not None:
LOG.info('Comparing %s to %s with a rel. tolerance of %s', oldval, newval, tolerance)
if oldval != pytest.approx(newval, rel=tolerance):
diff = oldval-newval
percent_diff = (oldval-newval)/oldval * 100 if oldval != 0 else 100
if abs(diff) > abs(worst_diff[0]):
worst_diff = diff, oldkey
if abs(percent_diff) > abs(worst_percent_diff[0]):
worst_percent_diff = percent_diff, oldkey
errorstr = f"{oldkey}\n\tvalues differ by > a relative difference of {tolerance}\n\texpected: {oldval}\n\tnew: {newval}\n\tdiff: {diff}\n\tpercent_diff: {percent_diff}%"
errors.append(errorstr)
elif oldval is not None:
# If only the new value is None, fail
errorstr = f"{oldkey}\n\tvalue is None\n\texpected: {oldval}\n\tnew: {newval}"
errors.append(errorstr)
else:
# If old and new values are both None, this is expected, so pass
LOG.info('Comparing %s and %s... both values are None.', oldval, newval)
[LOG.warning(x) for x in errors]
n_errors = len(errors)
if n_errors > 0:
summary_str = f"Worst absolute diff, {worst_diff[1]}: {worst_diff[0]}\nWorst percentage diff, {worst_percent_diff[1]}: {worst_percent_diff[0]}%"
errors.append(summary_str)
> pytest.fail("Failed to match {0} result value{1} within tolerance{1} :\n{2}".format(
n_errors, '' if n_errors == 1 else 's', '\n'.join(errors)), pytrace=True)
E Failed: Failed to match 2 result values within tolerances :
E s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_0.qa.metric.score_sd_image_sensitivity_ratio
E values differ by > a relative difference of 1e-07
E expected: 1.427446433699322
E new: 1.4275466845286737
E diff: -0.00010025082935172236
E percent_diff: -0.00702308871177153%
E s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_1.qa.metric.score_sd_image_sensitivity_ratio
E values differ by > a relative difference of 1e-07
E expected: 1.3625947414487243
E new: 1.361980167115621
E diff: 0.0006145743331031905
E percent_diff: 0.04510323681785011%
E Worst absolute diff, s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_1.qa.metric.score_sd_image_sensitivity_ratio: 0.0006145743331031905
E Worst percentage diff, s10.hsd_imaging.mg2-20170525142607-180419.field_Ori-KL.spw_1.qa.metric.score_sd_image_sensitivity_ratio: 0.04510323681785011%
tests/testing_utils.py:435: Failed