Last Updated: 2026-03-29 17:15 PT
Analysis of 13,090 wiki pages across all sections reveals the following quality metrics:
| Section |
Pages |
Avg Words |
Avg Refs |
Avg Links |
Quality Score |
| brain-regions |
50 |
1,827 |
5.2 |
46 |
33.3 |
| mechanisms |
1,194 |
2,513 |
13.2 |
48 |
38.2 |
| diseases |
402 |
2,396 |
10.0 |
27 |
38.9 |
| biomarkers |
142 |
1,517 |
2.2 |
15 |
28.4 |
| therapeutics |
603 |
1,808 |
9.1 |
30 |
25.7 |
| cell-types |
3,565 |
1,534 |
4.6 |
20 |
20.5 |
| genes |
3,793 |
696 |
3.9 |
— |
15.7 |
| proteins |
3,174 |
1,626 |
5.6 |
24 |
18.4 |
| researchers |
188 |
715 |
0.0 |
4 |
17.3 |
| companies |
335 |
829 |
0.0 |
14 |
12.6 |
| clinical-trials |
184 |
520 |
0.2 |
8 |
13.8 |
Key findings (2026-03-29):
- Core sections (mechanisms, diseases, brain-regions) maintain highest quality scores
- Reference coverage improved significantly — YAML frontmatter refs with hyperlinked PMIDs/DOIs now standard
- Cross-linking is comprehensive: 70,185 internal links across 3,146 files (avg 22.3 per file)
- Empty links: 0 across all sections
- No conflict markers in any content files
Earlier analysis showed similar patterns but with lower reference coverage:
| Section |
Pages |
Avg Words |
Avg Refs |
Ref Coverage |
| researchers |
188 |
715 |
0.0 |
0% |
| companies |
335 |
829 |
0.0 |
2% |
| institutions |
283 |
984 |
0.2 |
4% |
| clinical-trials |
184 |
520 |
0.2 |
6% |
| treatments |
25 |
1,213 |
0.2 |
12% |
| cell-types |
3,563 |
1,534 |
4.6 |
20% |
| biomarkers |
134 |
1,517 |
2.2 |
26% |
| proteins |
3,175 |
1,626 |
5.6 |
34% |
| brain-regions |
50 |
1,827 |
5.2 |
40% |
| therapeutics |
603 |
1,808 |
9.1 |
54% |
| diseases |
400 |
2,396 |
10.0 |
54% |
| mechanisms |
1,145 |
2,513 |
13.2 |
76% |
Weakest section (2026-03-22): companies (Quality Score: 12.6) with 44% of pages under 500 words
Quality score (0-100) is calculated as:
- 40% weight: Normalized word count (2,500 words = max)
- 35% weight: Normalized reference count (30 refs = max)
- 25% weight: Normalized link count (50 links = max)
Higher scores indicate better overall page quality.
- Core sections maintain quality — mechanisms and diseases consistently score highest
- Link density is no longer 0% — entity linker has run extensively across the corpus
- Reference coverage has improved — YAML frontmatter refs with hyperlinked sources are now standard
- No broken links — 0 empty internal or external links across all sections
- Clean conflict markers — 0 unresolved merge conflicts in content files
- Fixed 295+ YAML frontmatter files with bracket corruption, double-quoted titles, and malformed refs
- Ran entity linker across full corpus — 70,185 links verified
- Removed duplicate sections from 200+ mechanism pages
- Converted orphan redirect stubs to proper
state:redirect format
- Fixed Mermaid diagram syntax (proper node labels, no unicode arrows)
- Replaced bare PMIDs with hyperlinked YAML refs across all major pages