ContentGrapher
ContentGrapher
research/language-study/data
The language studyData

The data

Per language, averaged over 12 topics

The reported completeness score, the same score with the score’s name-check neutralised, the independent reviewer accuracy, and the share of concepts the analysis named in English.

LanguageReportedGap vs ENModel sawScore countedAccuracyEN-named
English0.6058%47%2.0317%
French0.41−0.1965%18%2.1469%
German0.45−0.1562%29%2.1142%
Japanese0.37−0.2368%16%2.9274%
Korean0.37−0.2368%11%2.8177%

“Model saw” is the share of concepts the analysis judged well-connected; “Score counted” is the share that survived the name-check. The gap between them is the broken step. Round-trip control (English → Japanese → English): score gap +0.01 [−0.03, 0.05]. n = 12 topics, three runs each.

Per topic, reported score by language

Each row is one page, scored in all five languages on identical content. The non-English score is lower in nearly every cell; the few exceptions (CBT, one Japanese run) show the effect is a strong tendency, not a fixed penalty.

TopicTierENFRDEJAKO
Compound interestgeneral0.640.380.590.400.40
Photosynthesisgeneral0.420.230.400.270.32
Cognitive behavioral therapygeneral0.630.660.610.540.70
Cancergeneral0.690.480.570.240.31
DNSprofessional0.640.350.390.360.43
Load balancingprofessional0.650.310.440.690.28
Kubernetesprofessional0.680.430.530.310.44
Scrumprofessional0.610.510.520.320.32
DNAexpert0.680.560.080.270.41
Bayes' theoremexpert0.470.320.470.320.35
Complementary medicineexpert0.470.300.450.460.19
Cellular respirationexpert0.630.400.340.320.34

The DNA row in German (0.08) is an extreme case of the same mechanism plus a one-off analysis hiccup on that run; the pattern is consistent across the rest. Tiers are general, professional, and expert.

← OverviewMethodology →