Evaluation

Evaluation v0: ranked feed vs simple baselines

Bridge familyDistributional checks only

This page calls GET /api/v1/evaluation/compare so you can inspect the same candidate pool under three orderings: materialized ranking, citation-sorted, and date-sorted. Nothing here measures whether researchers would find the papers useful; it only shows distributional checks on short lists.

Pool size

528

Run label

shadow-generalization-product-candidate-ranking-v1

Embedding

shadow-generalization-text-embedding-v1

Generated at

2026-06-26

Run label filter: shadow-generalization-product-candidate-ranking-v1

Focus paper: https://openalex.org/W4405899471 | compare limit 12 Visible in: date baseline.

Interpretation guardrails

Interpretation notes

These outputs help compare ranking behavior and expose drift; they are not expert-reviewed evidence that papers are useful to researchers.

Side-by-side lists share the same candidate pool for the selected recommendation family and corpus snapshot.
Recency, citation, and topic summaries are coarse proxies over the short lists shown; they do not measure whether a researcher would find a paper useful.
Topic overlap uses Jaccard similarity on topic labels attached to papers in this corpus, not semantic similarity of full text.
Use ranked outputs for product behavior; use this endpoint to sanity-check drift against naive orderings.

Run context

Run and pool

Run shadow-generalization-product-candidate-ranking-v1 | rank-83787b91ef | snapshot source-snapshot-shadow-generalization-v1-20260521 | embedding shadow-generalization-text-embedding-v1 | pool size 528

All included works in the corpus snapshot (same candidate set as the ranking run's emerging/bridge families).

Topic labels are imported metadata and can be noisy; use them as coarse navigation hints, not authoritative classifications.

Bridge family

Bridge distinctness diagnostics

Diagnostic only.
Not a researcher-reviewed usefulness benchmark.
Used to inspect whether this bridge review arm is worth further evaluation.
Use these diagnostics to decide whether the bridge experiment deserves further evaluation.

Pinned to the same ranking run as the compare response above. Overlap and coverage are structural checks on short heads, not measures of paper usefulness.

Head overlap (Jaccard)

Full bridge vs eligible-only bridge: overlap 0; Jaccard 0.0000
Full bridge vs emerging: overlap 7; Jaccard 0.4118
Eligible-only bridge vs emerging: overlap 0; Jaccard 0.0000

Eligibility (bridge rows)

True / false / null: 0 / 0 / 528

Score and signal coverage

Bridge score (non-null / null): 0 / 528
Bridge signal payload (present / missing): 0 / 528
Bridge family row count: 528

Decision support (heuristic)

Suggested next step: insufficient bridge signal coverage
Eligible head differs from full bridge: Yes
Eligible head is less emerging-like than full bridge: Yes

Provenance

Active ranking run: rank-83787b91ef
Active ranking version: shadow-generalization-product-candidate-ranking-v1
Corpus snapshot: source-snapshot-shadow-generalization-v1-20260521
Embedding version: shadow-generalization-text-embedding-v1
Cluster version: not recorded
Head size: 12
Generated at: 2026-06-26T21:31:38.242985Z

List overlap

Topic label overlap between lists

Jaccard index on the set of OpenAlex topic labels appearing in the top tags of each paper in the list. High overlap means similar topic mix, not similar intellectual content.

Ranked vs citation baseline: 0.2174
Ranked vs date baseline: 0.1053
Citation vs date baseline: 0.1818

Ranked (family)

List size 12

Focus paper: https://openalex.org/W4405899471 is not visible in this arm.

Materialized ranking run: order by final_score descending, then work_id (stable tie-break). Blend and signals follow this run's persisted family_weights and paper_scores (semantic may be used for Emerging when configured).

Order: final_score DESC, work_id ASC

Mean year

2025.0

Median cites

2.5

Unique topics

Proxy stats (list-only; not relevance)

Recency: mean year 2025.00; min-max 2024-2026; share in latest two years 91.7%
Citations: mean 2.75; median 2.50; range 0-6
Topic mix: 18 unique labels in list; top: Music and Audio Processing, Music Technology and Sound Studies, Diverse Musicological Studies, Spacecraft and Cryogenic Technologies, Fluid Dynamics Simulations and Interactions

Physical Modeling of a Spring Reverb Tank Incorporating Helix Angle, Damping, and Magnetic Bead Coupling
0.671
2025 | cites: 1 | jaes
Spacecraft and Cryogenic TechnologiesFluid Dynamics Simulations and InteractionsOil and Gas Production Techniques
Open dossier Topic momentum
CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research
0.663
2025 | cites: 6 | tismir
Music and Audio ProcessingDiverse Musicological StudiesMusic Technology and Sound Studies
Open dossier Topic momentum
Modeling Time-Variant Responses of Optical Compressors With Selective State Space Models
0.646
2025 | cites: 3 | jaes
Extremum Seeking Control SystemsReal-time simulation and control systems
Open dossier Topic momentum
Toward an Improved Auditory Model for Predicting Binaural Coloration
0.643
2025 | cites: 6 | jaes
Color Science and Applications
Open dossier Topic momentum
Towards an 'Everything Corpus': A Framework and Guidelines for the Curation of More Comprehensive Multimodal Music Data
0.630
2025 | cites: 2 | tismir
Music and Audio ProcessingDiverse Musicological StudiesNatural Language Processing Techniques
Open dossier Topic momentum
ChoraleBricks: A Modular Multitrack Dataset for Wind Music Research
0.624
2025 | cites: 3 | tismir
Music and Audio ProcessingMusic Technology and Sound StudiesAnimal Vocal Communication and Behavior
Open dossier Topic momentum
Inferring Communities of Medieval Music Manuscripts Using Stochastic Block Models
0.612
2026 | cites: 0 | tismir
Authorship Attribution and ProfilingDigital Humanities and ScholarshipMusic and Audio Processing
Open dossier Topic momentum
Beyond a Western Center of Music Information Retrieval: A Bibliometric Analysis of the First 25 Years of ISMIR Authorship
0.609
2025 | cites: 1 | tismir
Music and Audio ProcessingDiverse Musicological StudiesInformation Retrieval and Search Behavior
Open dossier Topic momentum
Analysis of Various 3D Acquisition Techniques and Mesh Differences for Head-related Transfer Functions Calculation
0.604
2025 | cites: 1 | jaes
Engineering Applied ResearchSimulation and Modeling Applications
Open dossier Topic momentum
The AI Music Arms Race: On the Detection of AI-Generated Music
0.603
2025 | cites: 2 | tismir
Music and Audio ProcessingMusic Technology and Sound StudiesTime Series Analysis and Forecasting
Open dossier Topic momentum
BPSD: A Coherent Multi-Version Dataset for Analyzing the First Movements of Beethoven's Piano Sonatas
0.600
2024 | cites: 5 | tismir
Music and Audio ProcessingNeuroscience and Music PerceptionMusic Technology and Sound Studies
Open dossier Topic momentum
STAR Drums: A Dataset for Automatic Drum Transcription
0.600
2025 | cites: 3 | tismir
Music and Audio ProcessingMusic Technology and Sound StudiesDiverse Musicological Studies
Open dossier Topic momentum

Citation baseline

List size 12

Focus paper: https://openalex.org/W4405899471 is not visible in this arm.

Popularity-style baseline on the same pool: highest citations first (not a relevance judgment).

Order: citation_count DESC, year DESC, openalex_id ASC

Mean year

2024.7

Median cites

6.0

Unique topics

Proxy stats (list-only; not relevance)

Recency: mean year 2024.67; min-max 2024-2025; share in latest two years 100.0%
Citations: mean 8.50; median 6.00; range 4-25
Topic mix: 10 unique labels in list; top: Music Technology and Sound Studies, Music and Audio Processing, Hearing Loss and Rehabilitation, Tactile and Sensory Interactions, Image and Video Quality Assessment

Digital Technology in Cultural Heritage: Construction and Evaluation Methods of AI-Based Ethnic Music Dataset
2024 | cites: 25 | -
Open dossier Topic momentum
Issues and Challenges of Audio Technologies for the Musical Metaverse
2025 | cites: 11 | jaes
Music Technology and Sound Studies
Open dossier Topic momentum
Testing Auditory Illusions in Augmented Reality: Plausibility, Transfer-Plausibility, and Authenticity
2024 | cites: 11 | jaes
Hearing Loss and RehabilitationTactile and Sensory InteractionsImage and Video Quality Assessment
Open dossier Topic momentum
FakeMusicCaps: A Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models
2025 | cites: 10 | -
Open dossier Topic momentum
Audiovisual Congruence and Localization Performance in Virtual Reality: 3D Loudspeaker Model vs. Human Avatar
2024 | cites: 7 | jaes
3D Surveying and Cultural Heritage
Open dossier Topic momentum
CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research
2025 | cites: 6 | tismir
Music and Audio ProcessingDiverse Musicological StudiesMusic Technology and Sound Studies
Open dossier Topic momentum
Toward an Improved Auditory Model for Predicting Binaural Coloration
2025 | cites: 6 | jaes
Color Science and Applications
Open dossier Topic momentum
Advancing deep learning for expressive music composition and performance modeling
2025 | cites: 6 | -
Open dossier Topic momentum
A HYBRID CNN-LSTM DEEP LEARNING MODEL FOR INTRUSION DETECTION IN SMART GRID
2025 | cites: 6 | -
Open dossier Topic momentum
PESTO: Real‑Time Pitch Estimation with Self‑Supervised Transposition‑Equivariant Objective
2025 | cites: 5 | tismir
Music and Audio ProcessingSpeech and Audio ProcessingMusic Technology and Sound Studies
Open dossier Topic momentum
BPSD: A Coherent Multi-Version Dataset for Analyzing the First Movements of Beethoven's Piano Sonatas
2024 | cites: 5 | tismir
Music and Audio ProcessingNeuroscience and Music PerceptionMusic Technology and Sound Studies
Open dossier Topic momentum
PAGURI: A User Experience Study of Creative Interaction with Text-to-Music Models
2025 | cites: 4 | -
Open dossier Topic momentum

Date baseline

List size 12

Focus paper: https://openalex.org/W4405899471 appears in this arm.

Pure recency baseline on the same pool: newest year first (not a relevance judgment).

Order: year DESC, openalex_id ASC

Mean year

2026.0

Median cites

0.0

Unique topics

Proxy stats (list-only; not relevance)

Recency: mean year 2026.00; min-max 2026-2026; share in latest two years 100.0%
Citations: mean 0.08; median 0.00; range 0-1
Topic mix: 3 unique labels in list; top: Music and Audio Processing, Music Technology and Sound Studies, Image Processing and 3D Reconstruction

Beyond Acoustics: Capacity Limitations of Linguistic Levels
2026 | cites: 1 | -
Open dossier Topic momentum
Regularized Autoregressive Modeling and Its Application to Audio Signal Reconstruction
2026 | cites: 0 | -
Open dossier Topic momentum
The Sound of Water: Inferring Physical Properties from Pouring Liquids
2026 | cites: 0 | -
Open dossier Topic momentum
Relevance-Guided Audio Visual Fusion for Video Saliency Prediction
2026 | cites: 0 | -
Open dossier Topic momentum
Explainable detection of machine generated music and early systematic evaluation
2026 | cites: 0 | -
Open dossier Topic momentum
M6: multi-generator, multi-domain, multi-lingual and cultural, multi-genres, multi-instrument machine-generated music detection databases
2026 | cites: 0 | -
Focus paper
Open dossier Topic momentum
Separate this, and all of these Things Around It: Music Source Separation Via Hyperellipsoidal Queries
2026 | cites: 0 | -
Open dossier Topic momentum
SingMOS-Pro: An Comprehensive Benchmark For Singing Quality Assessment
2026 | cites: 0 | -
Open dossier Topic momentum
Methods for Pitch Analysis in Contemporary Popular Music: Multiple Pitches From Harmonic Tones in Vitalic's Music
2026 | cites: 0 | jaes
Open dossier Topic momentum
Deepaq: A Perceptual Audio Quality Metric Based on Foundational Models and Weakly Supervised Learning
2026 | cites: 0 | -
Open dossier Topic momentum
BACHI: Boundary-Aware Symbolic Chord Recognition Through Masked Iterative Decoding on POP and Classical Music
2026 | cites: 0 | -
Open dossier Topic momentum
A Lightweight Two‑Branch Architecture for Multi‑Instrument Transcription via Note‑Level Contrastive Clustering
2026 | cites: 0 | tismir
Music and Audio ProcessingMusic Technology and Sound StudiesImage Processing and 3D Reconstruction
Open dossier Topic momentum

Generated at 2026-06-26T21:31:38.162408Z. This page shows citation and date baselines plus distributional checks on short lists. For roadmap-style framing, see /api/v1/evaluation/summary.