Find the Distribution of English Words' Frequency: New in Wolfram Language 11

Find the Distribution of English Words' Frequency

Count the occurrence of words in the US Constitution.

In[1]:=

text = ExampleData[{"Text", "USConstitution"}, "Words"];
wordCount = Values[Counts[text]];

Find a simple distribution for the word counts.

In[2]:=

e\[ScriptCapitalD] = FindDistribution[wordCount]

Out[2]=

Compare the found distribution with the word counts.

show complete Wolfram Language input

In[3]:=

Show[Histogram[wordCount, {0.5, 15.5, 1}, "ProbabilityDensity", 
  PlotLabel -> "Word Count Distribution"], 
 DiscretePlot[PDF[e\[ScriptCapitalD], x], {x, 1, 15}, 
  PlotStyle -> PointSize[Large], 
  PlotLegends -> {"e\[ScriptCapitalD]"}]]

Out[3]=

Wolfram Mathematica

Find the Distribution of English Words' Frequency

Related Examples