Roadmap: embeddings
ML milestone 1 fills semantic_score and retrieval; bridge-style scores follow once clusters are available.
Recommended
Papers come from a materialized ranking run (paper_scores per family). The API derives plain-language explanations from the same weights stored on the run. The undercited family only scores works in the frozen low-cite candidate pool (docs/candidate-pool-low-cite.md v0), scoped to your corpus snapshot.
ml2-5a-qual-r2-k6-20260405 | run rank-83976f1097 | snapshot source-snapshot-20260329-170012 | 15 papers
Web is filtering runs with NEXT_PUBLIC_RANKING_VERSION=ml2-5a-qual-r2-k6-20260405.
Family: undercited | order by materialized final_score descending
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.6691, diversity_penalty=0.0000
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0521, topic_growth=0.7179, diversity_penalty=0.2242
Supervised Contrastive Models for Music Information Retrieval in Classical Persian Music
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.5872, diversity_penalty=0.0000
Cross‑Modal Approaches to Beat Tracking: A Case Study on Chopin Mazurkas
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.5872, diversity_penalty=0.0000
ChoraleBricks: A Modular Multitrack Dataset for Wind Music Research
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.1042, topic_growth=0.5872, diversity_penalty=0.3554
RWC Revisited: Towards a Community-Driven MIR Corpus
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.4622, diversity_penalty=0.0000
Correction: Salsa, a Dataset for Beat Estimation in Salsa Music
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.4622, diversity_penalty=0.0000
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.1042, topic_growth=0.5025, diversity_penalty=0.3554
Smartwatch-Based Audio-Gestural Insights in Violin Bow Stroke Analyses
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.3751, diversity_penalty=0.0000
MGPHot: A Dataset of Musicological Annotations for Popular Music (1958-2022)
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.3751, diversity_penalty=0.0000
The GigaMIDI Dataset with Features for Expressive Music Performance Detection
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.3751, diversity_penalty=0.0000
The Story Behind the Real World Computing Music Database: An Interview with Masataka Goto
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.3751, diversity_penalty=0.0000
BeatNet+: Real‑Time Rhythm Analysis for Diverse Music Audio
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.0000, topic_growth=0.3751, diversity_penalty=0.0000
CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.1563, topic_growth=0.4622, diversity_penalty=0.4485
Perceptual and automated estimates of infringement in 40 music copyright cases
Low-cite candidate pool (see docs/candidate-pool-low-cite.md v0): core corpus, recency floor, citation ceiling, title+abstract gate; popularity penalty among pool members only. Semantic and bridge not yet modeled.
Signals: citation_velocity=0.4167, topic_growth=0.4622, diversity_penalty=0.9166
ML milestone 1 fills semantic_score and retrieval; bridge-style scores follow once clusters are available.
The rule-only undercited list (/api/v1/recommendations/undercited) uses the same pool definition but is not tied to a corpus snapshot. For snapshot-scoped A/B against the ranked undercited family, use Evaluation.