Machine Learning: Elementary Introduction to the Wolfram Language

Explore the latest version of An Elementary Introduction to the Wolfram Language »

22	Machine Learning

So far in this book, when we’ve wanted the Wolfram Language to do something, we’ve written code to tell it exactly what to do. But the Wolfram Language is also set up to be able to learn what to do just by looking at examples, using the idea of machine learning.

We’ll talk about how to train the language yourself. But first let’s look at some built-in functions that have already been trained on huge numbers of examples.

LanguageIdentify takes pieces of text, and identifies what human language they’re in.

Identify the language each phrase is in:

In[1]:=

Out[1]=

The Wolfram Language can also do the considerably more difficult “artificial intelligence” task of identifying what an image is of.

Identify what an image is of:

In[2]:=

Out[2]=

There’s a general function Classify, which has been taught various kinds of classification. One example is classifying the “sentiment” of text.

Upbeat text is classified as having positive sentiment:

In[3]:=

Out[3]=

Downbeat text is classified as having negative sentiment:

In[4]:=

Out[4]=

You can also train Classify yourself. Here’s a simple example of classifying handwritten digits as 0 or 1. You give Classify a collection of training examples, followed by a particular handwritten digit. Then it’ll tell you whether the digit you give is a 0 or 1.

With training examples, Classify correctly identifies a handwritten 0:

In[5]:=

Out[5]=

To get some sense of how this works—and because it’s useful in its own right—let’s talk about the function Nearest, that finds what element in a list is nearest to what you supply.

Find what number in the list is nearest to 22:

In[6]:=

Out[6]=

Find the nearest three numbers:

In[7]:=

Out[7]=

Nearest can find nearest colors as well.

Find the 3 colors in the list that are nearest to the color you give:

In[8]:=

Out[8]=

It also works on words.

Find the 10 words nearest to “good” in the list of words:

In[9]:=

Out[9]=

There’s a notion of nearness for images too. And though it’s far from the whole story, this is effectively part of what ImageIdentify is using.

Something that’s again related is recognizing text (optical character recognition or OCR). Let’s make a piece of text that’s blurred.

Create an image of the word “hello”, then blur it:

In[10]:=

Out[10]=

TextRecognize can still recognize the original text string in this.

Recognize text in the image:

In[11]:=

Out[11]=

If the text gets too blurred TextRecognize can’t tell what it says—and you probably can’t either.

Generate a sequence of progressively more blurred pieces of text:

In[12]:=

Out[12]=

As the text gets more blurred, TextRecognize makes a mistake, then gives up altogether:

In[13]:=

Out[13]=

Something similar happens if we progressively blur the picture of a cheetah. When the picture is still fairly sharp, ImageIdentify will correctly identify it as a cheetah. But when it gets too blurred ImageIdentify starts thinking it’s more likely to be a lion, and eventually the best guess is that it’s a picture of a person.

Progressively blur a picture of a cheetah:

In[14]:=

Out[14]=

When the picture gets too blurred, ImageIdentify no longer thinks it’s a cheetah:

In[15]:=

Out[15]=

ImageIdentify normally just gives what it thinks is the most likely identification. You can tell it, though, to give a list of possible identifications, starting from the most likely. Here are the top 10 possible identifications, in all categories.

ImageIdentify thinks this might be a cheetah, but it’s more likely to be a lion, or it could be a dog:

In[16]:=

Out[16]=

When the image is sufficiently blurred, ImageIdentify can have wild ideas about what it might be:

In[17]:=

Out[17]=

In machine learning, one often gives training that explicitly says, for example, “this is a cheetah”, “this is a lion”. But one also often just wants to automatically pick out categories of things without any specific training.

One way to start doing this is to take a collection of things—say colors—and then to find clusters of similar ones. This can be achieved using FindClusters.

Collect “clusters” of similar colors into separate lists:

In[18]:=

Out[18]=

You can get a different view by connecting each color to the three most similar colors in the list, then making a graph out of the connections. In the particular example here, there end up being three disconnected subgraphs.

Create a graph of connections based on nearness in “color space”:

In[19]:=

Out[19]=

A dendrogram is a tree-like plot that lets you see a whole hierarchy of what’s near what.

Show nearby colors successively grouped together:

In[20]:=

Out[20]=

When we compare things—whether they’re colors or pictures of animals—we can think of identifying certain features that allow us to distinguish them. For colors, a feature might be how light the color is, or how much red it contains. For pictures of animals, a feature might be how furry the animal looks, or how pointy its ears are.

In the Wolfram Language, FeatureSpacePlot takes collections of objects and tries to find what it considers the “best” distinguishing features of them, then uses the values of these to position objects in a plot.

FeatureSpacePlot doesn’t explicitly say what features it’s using—and actually they’re usually quite hard to describe. But what happens in the end is that FeatureSpacePlot arranges things so that objects that have similar features are drawn nearby.

FeatureSpacePlot makes similar colors be placed nearby:

In[21]:=

Out[21]=

If one uses, say, 100 colors picked completely at random, then FeatureSpacePlot will again place colors it considers similar nearby.

100 random colors laid out by FeatureSpacePlot:

In[22]:=

Out[22]=

Let’s try the same kind of thing with images of letters.

Make a rasterized image of each letter in the alphabet:

In[23]:=

Out[23]=

FeatureSpacePlot will use visual features of these images to lay them out. The result is that letters that look similar—like y and v or e and c—will wind up nearby.

In[24]:=

Out[24]=

Here’s the same thing, but now with pictures of cats, cars and chairs. FeatureSpacePlot immediately separates the different kinds of things.

FeatureSpacePlot places photographs of different kinds of things quite far apart:

In[25]:=

Out[25]=

Vocabulary

LanguageIdentify[text]		identify what human language text is in
ImageIdentify[image]		identify what an image is of
TextRecognize[text]		recognize text from an image (OCR)
Classify[training,data]		classify data on the basis of training examples
Nearest[list,item]		find what element of list is nearest to item
FindClusters[list]		find clusters of similar items
NearestNeighborGraph[list,n]		connect elements of list to their n nearest neighbors
Dendrogram[list]		make a hierarchical tree of relations between items
FeatureSpacePlot[list]		plot elements of list in an inferred “feature space”

Exercises

Check your answers in the Wolfram Cloud

22.1Identify what language the word “ajatella” comes from. »

Expected output:

Out[]=