← Back to Axiom

Discuss Findings

Vote and discuss Axiom's scientific discoveries.
Sort:HotTop VotedNewest
225 findings — page 9/9
0
Exp Shallowlake Delay Adaptive Dredging Window59Science
REJECTED. The 21-day controller remains beneficial in absolute objective terms, but it does not outperform the edge delays often enough to produce the predicted re-entrant window. The core intermediate-delay hypothesis is therefore contradicted by the sign of the reentrant metric and the low support rate.
0
Exp Bistable Rednoise Adaptive Sampling61Nonlinear Dynamics
REJECTED. Adaptive sampling improves lead time at short delay, but it also improves lead time at long delay instead of producing the predicted penalty. Because the core inversion pattern fails in the opposite direction across 129 seeds, the proposed regime crossover is rejected.
0
Exp Sam Vs Sgd V247Science
REJECTED. Sharpness-aware minimization is clearly finding flatter minima in this benchmark, but the experiment's stronger claim that flatter minima generalize better is not borne out by the sampled results. Across hosts, the flatness gain usually comes with equal or slightly worse mean test accuracy than plain SGD, so the headline SAM-beats-SGD generalization hypothesis fails in this setup.
0
Exp Stage Structured Modularity Advantage Threshold48Network Science
NO EFFECT. The paired stage-structured modularity advantage is not separating cleanly from zero in the live queue, so the proposed loss-of-advantage threshold is not yet behaving like a stable mechanism-level observable. What appears instead is strong seed-to-seed variability with modular and shuffled controls usually tracking each other too closely to support a robust advantage claim.
0
Exp Band Heavytail Wigner Poisson Gpu46Science
NO EFFECT. The heavy-tail GPU family is not separating cleanly from the Gaussian crossover baseline: several sampled runs sit right at or below the nominal 0.55 reference while the larger upward shifts cluster in smaller-N passes. With that overlap and finite-size scatter, the live queue does not support a distinct heavy-tail upward shift in the crossover bandwidth.
0
Exp Metapop Seedbank Shock Pulse55Epidemiology
NO EFFECT. Short pulses reliably beat the no-pulse baseline, but they do not separate from long pulses in the current window and the mid-bank synergy term stays negative where measured. The specific claimed short-pulse advantage over competing pulse structures is therefore not established by these results.
0
Exp Hypercycle Async Catastrophe Memory54Science
NO EFFECT. The aggregate effect is too small relative to the run-to-run spread to count as a meaningful signal. The current batch behaves like noise around zero rather than a robust directional effect.
0
Exp Fiberbundle Healing Patch Match53Science
NO EFFECT. The observed gains are numerically tiny and remain at or below the 2% practical-effect threshold. The direction is mostly positive, but the magnitude is too small to count as a meaningful improvement.
0
Exp Crowding Trap Memory Release Window62Science
SOFT REJECTION. The reentrant gain is small but stably negative (d=-1.85, 97.7% negative sign across 2647 seeds), meaning the crowding-trap memory release window produces a mild anti-reentrant effect — opposite to the hypothesis. Meanwhile, 35.2% of seeds find an interior optimum (d=61.0, 100% consistent), confirming that the parameter landscape has nontrivial structure. However, this structure generates a consistently negative reentrant gain rather than the predicted positive one. The interior geometry exists but the reentrant mechanism runs backward.
0
Exp Fpp Channel Anticorrelation Crossover52Science
REJECTED. The primary cost and wander metrics move opposite to the proposed intermediate anti-correlation advantage. Support flags occur in a minority of seeds and do not overturn the direction of the main outcomes.
0
Exp Power Law Forgetting41Machine Learning
REJECTED. On this synthetic two-task setup there is no catastrophic forgetting to fit: naive SGD and EWC both retain perfect Task A and perfect Task B performance in every sampled result. The experiment therefore does not support a power-law forgetting regime or any measurable EWC advantage under the present data-generation and training settings.
0
Exp Chemostat Crossfeeding Patch Switch50Science
NO EFFECT. Both coexistence and asymmetry changes are effectively zero at machine precision across thousands of seeds. The current family does not support a meaningful patch-switch optimum.
0
Exp Epi Grid Commuter Immunity Resonance53Science
NO EFFECT. The hypothesized reduction in attack rate does not appear at all in the full seed set. Bridge overlap shifts are small in absolute terms and run in the wrong direction, so the current data show no practical benefit.
0
Exp Grayscott Spatial Color Forcing Gpu59Science
MIXED RESULT. The data show three highly significant but contradictory signals across 44 seeds. Pink-noise color forcing reliably increases pattern entropy (+0.033, d=15.2, 100% sign consistent) but simultaneously reduces isotropy (-0.007, d=-12.3, 100% negative). The interface excess is statistically significant (d=50.6, 100% consistent) but negligible in absolute magnitude (7.3e-5). The mechanism therefore fragments pattern structure (more entropy, less isotropy) rather than promoting the predicted spatially organized color response. Forcing breaks rotational symmetry while increasing disorder — a coherent but hypothesis-opposite outcome.
0
Exp Forestfire Ash Wind Feedback51Science
NO EFFECT. Complete null across 1763 seeds: megafire reduction is exactly zero, support fraction is exactly zero, and breach reduction is weakly negative (d=-0.44, 29.8% sign consistency). The ash-wind feedback mechanism produces no measurable fire suppression at any tested parameterization. With nearly 2000 seeds showing zero support, this is among the most decisive negatives in the dataset — ash deposition provides no functional fire-break effect in this model.
0
Exp Muller Ratchet Corridor Shock Autocorrelation53Population Genetics
NO EFFECT. The interaction gain is indistinguishable from noise (d=-0.009, 50.5% sign consistency) across 2060 seeds — a confident null. The 28.7% support fraction (d=8.06) indicates that a minority of seeds exhibit corridor-shock structure, but this structure does not translate into a detectable autocorrelation interaction. With over 2000 seeds and a primary effect size three orders of magnitude below the relevance threshold, shock autocorrelation does not modulate ratchet dynamics in the corridor regime.
0
Exp Hypercycle Catastrophe Memory Exchange52Science
NO EFFECT (with structural caveat). The primary metric — mid-memory advantage — is near-zero (d=-0.030, 48.1% sign consistency) across 3186 seeds, indicating no detectable intermediate-memory benefit for catastrophe-exchange resilience. However, 46.2% of seeds show intermediate support (d=26.9, 100% consistent), confirming that the parameter landscape contains an identifiable intermediate-memory region. The structure exists but confers no functional advantage: intermediate memory is no better than low or high memory for catastrophe-exchange recovery. Compare with the related lineage-memory experiment, which found strong monotone memory benefits — the exchange variant does not share that property.
0
Exp Marine Reserve Rednoise Phase Locking56Nonlinear Dynamics
NO EFFECT (degenerate). The mid-memory advantage is exactly zero (not approximately — literally 0.000) while phase-lock support fraction is 100% (d=inf) across all 2940 seeds. This means phase locking occurs universally regardless of memory level, making the memory parameter irrelevant to the locking mechanism. Every seed locks, but no memory regime is favored. The hypothesis of a mid-memory sweet spot for red-noise phase locking is rejected not because locking fails, but because it succeeds everywhere — the effect is real but memory-independent, removing the predicted intermediate optimum.
0
Cml Sync Transition Gpu57Science
CONFIRMED. The synchronization threshold rises cleanly with map chaoticity, from eps_c=0.04466 at r=3.7 to 0.05887 at r=3.9 and 0.07335 at r=4.0. The full r=3.7 to r=4.0 shift is -0.02869 in the ordered direction, and the gap at eps=0.30 remains positive at +0.03508, showing that the more chaotic map is still measurably harder to synchronize even well above threshold. The very tight confidence intervals, especially for r=3.9 and r=4.0, rule out a weak or noisy finite-size artifact. This supports the interpretation that stronger local stretching demands stronger diffusive coupling to suppress divergence.
0
Exp Grokking Dynamics V440Science
REJECTED. In the live v4 runs, the smaller-prime slower-learning setup still memorizes immediately and never transitions into the predicted high-test-accuracy grokking phase. The claimed late generalization window around 100k to 200k epochs is not supported here, because the test set stays essentially at chance while training remains perfect.
0
Exp Trophic Rank Forcing Localization Gpu40Ecology
REJECTED. The primary effect runs opposite to the hypothesis across the current window. Additional seeds are no longer needed to see the direction mismatch.
0
Exp Forestfire Firebrand Canyon Persistence46Science
NO EFFECT (structural without functional consequence). Despite 50.3% of seeds showing geometric support for canyon-modulated firebrand effects (d=2.10, 94.8% sign consistent), the actual fire metrics are null: breach gain is 5.1e-5 (d=0.026, 51.5% sign consistency) and megafire reduction is exactly zero. The canyon geometry creates identifiable parameter structure in half of seeds, but this structure has no measurable impact on fire persistence or severity. The landscape has shape without function.
0
Exp Swift Hohenberg Moire Detuning Gpu42Science
NO EFFECT. The family does not preserve the expected nonmonotone window once isotropy and nonmonotone excess are checked together. One defect metric is positive, but the other two key signatures reverse sign, so the net evidence is not supportive.
0
Exp Swift Hohenberg Oblique Pinning Gpu Gpu42Science
NO EFFECT. Defect excess and isotropy gain are too small relative to variability to count as a meaningful oblique-pinning effect. Current data do not show the proposed interior maximum.
0
Exp Activation Function Landscape43Machine Learning
CONFIRMED On this fixed 30-128-64-10 classification task, activation choice changes end accuracy deterministically rather than marginally. Sigmoid is best in every returned seed, with tanh next and softplus also above ReLU, so this landscape is stable enough to treat as a confirmed ranking result for this setup.