Scoring System
After fetching content from all sources, Horizon uses an AI model to score each item on a 0-10 scale. This determines what appears in the daily summary.
Pipeline
- Batch processing — Items are scored in batches of 10 with a progress bar. Failed items receive a score of 0.
- Content preparation — For each item, the content is truncated (800 chars if comments are present, 1000 otherwise) and engagement metrics are assembled from metadata (HN score, Reddit upvote ratio, etc.).
- AI analysis — The prepared content is sent to the configured AI model (temperature 0.3) with a system prompt defining the scoring criteria.
- Response parsing — The AI response is parsed as JSON (with fallbacks for code-block-wrapped JSON). Each item gets:
ai_score(float),ai_reason(string),ai_summary(string), andai_tags(list). - Retry — Failed AI calls are retried up to 3 times with exponential backoff (2-10 seconds).
Scoring Scale
| Score | Tier | Description |
|---|---|---|
| 9-10 | Groundbreaking | Major breakthroughs, paradigm shifts, major version releases, significant research breakthroughs |
| 7-8 | High Value | Important developments, technical deep-dives, novel approaches, insightful analysis, valuable tools |
| 5-6 | Interesting | Incremental improvements, useful tutorials, moderate community interest |
| 3-4 | Low Priority | Minor updates, common knowledge, overly promotional |
| 0-2 | Noise | Spam, off-topic, trivial updates |
Scoring Factors
The AI evaluates each item based on:
- Technical depth and novelty — original ideas, new techniques, research contributions
- Potential impact — how broadly this affects software engineering, AI/ML, or systems research
- Quality of writing/presentation — clarity, structure, thoroughness
- Community discussion — insightful comments, diverse viewpoints, substantive debates
- Engagement signals — high upvotes/favorites paired with substantive discussion (not just raw numbers)
Engagement metadata is source-specific: HN provides score and comment count, Reddit provides upvote ratio and comment count.
Filtering
After scoring, items are filtered by filtering.ai_score_threshold (default: 7.0) and sorted by score descending. Only items meeting the threshold appear in the daily summary.
{
"filtering": {
"ai_score_threshold": 7.0,
"time_window_hours": 24
}
}
Items scoring 9.0 or above are featured in the “Today’s Highlights” section of the summary.
Enrichment
Items that pass the score threshold go through a second AI pass for enrichment (src/ai/enricher.py):
- Concept extraction — AI identifies 1-3 technical concepts in the item that may need explanation.
- Web search — Each concept is searched via DuckDuckGo to gather grounding context.
- Structured analysis — The item content and search results are sent to AI, which produces:
whats_new— what specifically happened or changedwhy_it_matters— significance and impactkey_details— notable technical details or caveatsbackground— background knowledge for readers without deep domain expertise
These fields are combined into a detailed_summary stored in the item’s metadata and used in the final daily summary.