This section audits numerical/empirical consistency: reported metrics, experimental design, baseline comparisons, statistical evidence, leakage risks, and reproducibility.
Arithmetic and consistency checks for dataset dimensions, spacing, time indexing, and several narrative metrics largely pass. However, $4$ of $5$ null-space “cancellation” coefficient-sum demonstrations fail the stated near-zero behavior by large margins, suggesting sign/transcription/grouping errors in those specific illustrative coefficient lists.
### Checked items
- ✔ C1_grid_points_cube (Page 1 Abstract; Page 2 §2.1 Dataset)
- Claim: Dataset is on a "$128^3$ periodic grid" / "grid of size $128^3$ points".
- Checks: integer_power_consistency
- Verdict: PASS
- Notes: $128^3 = 2,097,152$ exactly.
- ✔ C2_spatial_resolution_from_L_over_N (Page 2 §2.1 Dataset)
- Claim: With box size $L = 1$ and grid size $128^3$, spatial resolution is stated as $\Delta x = \Delta y = \Delta z = 1/128$.
- Checks: unit_consistent_recompute
- Verdict: PASS
- Notes: $L/N = 1/128 = 0.0078125$.
- ✔ C3_dataset_shape_product (Page 2 §2.1 Dataset)
- Claim: Full dataset shape is $(10, 4, 128, 128, 128)$.
- Checks: shape_size_recompute
- Verdict: PASS
- Notes: Total scalar values $= 10\times4\times128\times128\times128 = 83,886,080$.
- ✔ C4_time_step_and_slices_implied_span (Page 2 §2.1 Dataset; Page 4 Figure 1 caption ($t = 0, 4, 9$))
- Claim: There are $10$ time slices uniformly spaced with $\Delta t = 1$; representative times include $t = 0, 4, 9$.
- Checks: sequence_consistency
- Verdict: PASS
- Notes: Span $(10-1)\times1 = 9$ matches $t_\text{max}-t_\text{min}$; $4$ is within $[0,9]$.
- ✔ C5_central_difference_excluded_points_count (Page 3 §2.4 Temporal derivatives)
- Claim: Temporal derivatives use central difference; "The first and last time points were excluded".
- Checks: count_recompute
- Verdict: PASS
- Notes: Timepoints with $\partial_t$ defined: $10-2 = 8$.
- ✔ C6_total_training_targets_points (Page 3 §2.4 Temporal derivatives; Page 2 §2.1 Dataset)
- Claim: Each candidate term evaluated at every spatial-temporal point where $\partial_t u$ was computed; with $128^3$ grid and excluding first/last times.
- Checks: rows_count_recompute
- Verdict: PASS
- Notes: Rows per variable: $2,097,152\times8 = 16,777,216$; all $4$ vars: $67,108,864$.
- ✖ C7_advection_cancellation_sum (Page 5 §3.2.1 The null-space phenomenon (Advection Cancellation bullet))
- Claim: Advection Cancellation: adv_$v_z$ coefficient $+145.7538$ and parts $-145.7541$($v_x\partial_x v_z$), $-145.7541$($v_y\partial_y v_z$), $-145.7471$($v_z\partial_z v_z$) "resulting in a sum near zero".
- Checks: linear_combination_near_zero
- Verdict: FAIL
- Notes: Residual $= 145.7538 -145.7541 -145.7541 -145.7471 = -291.5015$ (not near $0$).
- ✖ C8_laplacian_cancellation_sum (Page 5 §3.2.1 The null-space phenomenon (Laplacian Cancellation bullet))
- Claim: Laplacian Cancellation: $\nabla^2\rho$ coefficient $+85.83159$ and diagonal second derivatives $-85.83160$($\partial_x^2\rho$), $-85.83158$($\partial_y^2\rho$), $-85.83158$($\partial_z^2\rho$) "perfectly canceling out".
- Checks: linear_combination_near_zero
- Verdict: FAIL
- Notes: Residual $= 85.83159 -85.83160 -85.83158 -85.83158 = -171.66317$ (not near $0$).
- ✖ C9_curl_cancellation_sum (Page 5 §3.2.1 The null-space phenomenon (Curl Cancellation bullet))
- Claim: Curl Cancellation: curl$_x$ weighted $-45.127$; individual derivatives $+45.167$($\partial_y v_z$) and $-45.100$($\partial_z v_y$) "net contribution approximately zero".
- Checks: linear_combination_near_zero
- Verdict: FAIL
- Notes: Residual $= -45.127 +45.167 -45.100 = -45.06$ (not near $0$).
- ✖ C10_divergence_cancellation_sum (Page 5 §3.2.1 The null-space phenomenon (Divergence Cancellation bullet))
- Claim: Divergence Cancellation: $\nabla\cdot\mathbf{v}$ received $-19.468$, offset by $+19.339$($\partial_x v_x$), $+19.327$($\partial_y v_y$), $+19.347$($\partial_z v_z$).
- Checks: linear_combination_near_zero
- Verdict: FAIL
- Notes: Residual $= -19.468 +19.339 +19.327 +19.347 = +38.545$ (not near $0$).
- ✔ C11_density_constant_cancellation_sum (Page 5 §3.2.1 The null-space phenomenon (Density Constant Cancellation bullet))
- Claim: Density Constant Cancellation: $95.53(\rho) - 47.88(\rho^2) - 47.65(\text{const}) \approx 0$.
- Checks: scalar_balance_at_rho_approx_1
- Verdict: PASS
- Notes: At $\rho=1$: $95.53 -47.88 -47.65 = 0.00$.
- ✔ C12_R2_range_matches_listed_values_velocity (Page 6 §3.3.1 Local derivative accuracy; Figure 5 caption; Page 1 Abstract; Page 2 end of Introduction)
- Claim: $R^2$ values between $0.593$ and $0.732$ for velocity components; reported individual $R^2$ are $0.678$ ($v_x$), $0.732$ ($v_y$), $0.593$ ($v_z$).
- Checks: range_consistency
- Verdict: PASS
- Notes: min/max of $\{0.678,0.732,0.593\}$ equals $\{0.593,0.732\}$.
- ✔ C13_velocity_rmse_order_of_magnitude_bounds (Page 6 §3.3.1 Local derivative accuracy; Figure 5 caption)
- Claim: Velocity derivative RMSE stated to be on the order of $5\times10^{-3}$ to $7\times10^{-3}$.
- Checks: interval_parse_check
- Verdict: PASS
- Notes: Parsed bounds $0.005 < 0.007$; both are $O(1 \times 10^{-3})$.
- ✔ C14_density_rmse_scientific_notation_parse (Page 6 §3.3.1 Local derivative accuracy; Figure 5 caption)
- Claim: Density derivative RMSE reported as $1.7 \times 10^{-4}$.
- Checks: scientific_notation_parse
- Verdict: PASS
- Notes: $1.7\times10^{-4} = 0.00017$.
- ✔ C15_forward_rmse_vs_std_percentage (Page 6 §3.3.2 Forward predictive modeling)
- Claim: Forward prediction: RMSE for predicted velocity states $\approx 5\times10^{-3}$, representing roughly $2\%$ of the standard deviation of the velocity fields; earlier std dev stated around $0.23$ to $0.25$.
- Checks: percentage_recompute
- Verdict: PASS
- Notes: $0.005/0.23=0.02174$ ($2.174\%$); $0.005/0.25=0.02$ ($2.0\%$), consistent with “roughly $2\%$”.
- ✔ C16_forward_R2_thresholds_consistency (Page 6 §3.3.2 Forward predictive modeling; Page 1 Abstract; Page 2 end of Introduction; Page 7 Conclusions)
- Claim: Forward-time integration yields $R^2$ values exceeding $0.999$ for velocity fields and $0.992$ for density over a subsequent time step.
- Checks: logical_threshold_check
- Verdict: PASS
- Notes: Both values lie in $[0,1]$; $0.992 < 0.999$ is permissible given the claim structure.
- ✔ C17_active_terms_range_width (Page 5 §3.2 Equation discovery; Page 7 Conclusions)
- Claim: Number of active terms ranges from $83$ to $93$ terms per variable.
- Checks: range_width_recompute
- Verdict: PASS
- Notes: Range width $= 93-83 = 10$.
### Limitations
- Only parsed text content was available; numeric values embedded solely in plots/graphics (Figures 2, 4–6) cannot be extracted reliably without image/pixel data.
- Many performance metrics ($R^2$/RMSE) and statistical summaries are stated narratively; recomputation would require access to the underlying dataset and model outputs, which are not included in the PDF text.
- Null-space/cancellation checks can only verify arithmetic of reported coefficients, not whether the corresponding features are exactly linearly dependent in the constructed feature matrix.
## Paper Ratings
| Dimension | Score |
|-----------|:-----:|
| Overall | 5/10 █████░░░░░ |
| Soundness | 5/10 █████░░░░░ |
| Novelty | 6/10 ██████░░░░ |
| Significance | 5/10 █████░░░░░ |
| Clarity | 6/10 ██████░░░░ |
| Evidence Quality | 5/10 █████░░░░░ |
Justification: The paper presents a competent SINDy-style PDE-discovery workflow and a useful case study on null-space exploitation under feature collinearity, with appropriate spectral differentiation and clear short-horizon predictions. However, the Mathematical Consistency Audit flags a critical preprocessing inconsistency (standard scaling with a constant feature) and arithmetic mismatches in the cancellation examples, and the review raises major concerns about identifiability vs. prediction, under-specified dataset provenance, potential train/test leakage, and the narrow one-step validation. These issues limit the strength of the physical claims and the reproducibility of the results, keeping soundness and evidence quality at a moderate level despite the informative case-study insights. Overall, it is a borderline contribution that would benefit from clearer experimental design, explicit library specification, and stronger validation/ablations.