www.stateoftheunion.onetwothree.net Open in urlscan Pro
192.254.233.47  Public Scan

URL: https://www.stateoftheunion.onetwothree.net/
Submission: On November 16 via api from GB — Scanned from GB

Form analysis 1 forms found in the DOM

GET search/search.php

<form action="search/search.php" method="get" target="more" class="noMargin">
  <input name="query" size="34" type="text" id="field">
  <input name="submit" type="submit" id="button" value="Search">
  <input name="search" value="1" type="hidden">
</form>

Text Content

 * State of the Union
 * Essay
 * Addresses
 * Appendices
 * About

Your browser does not support the canvas tag.

JavaScript is required to view the contents of this page.


HOW TO NAVIGATE THE INTERFACE

To move between State of the Union addresses, click or drag on the graph below
the word cloud; the president who delivered each address and the date of
delivery will appear at the bottom of the screen and in the center. The current
date will appear in white and the previous one in red.

You can also use the right and left arrow keys to move one year at a time.

The words from the previous address viewed will appear in red when your mouse is
over the cloud window so you can compare them. Mouse out to make them fade into
the background.

Click on a word to view the full State of the Union address in the window to the
right; the selected word will be highlighted.

Click below the graph or on a blank area of the word cloud to view the Wikipedia
U.S. History Timeline describing events that happened in and around the year of
the address.

Words automagically move to avoid overlapping. On mouseover, a line and dot
indicate the original position (according to the data). To make the words move
more, press the up arrow.

Use the search box to mine all the State of the Union addresses for occurrences
of specific words (use quotes to search for phrases such as “United States”).

Icons below the Timeline indicate the distribution of the address:

> Spoken: the text was delivered orally.
> Written: the text was delivered as a written document.
> Day: the text was broadcast during the day.
> Night: the text was broadcast in the evening.
> Radio: the text was broadcast live on the radio.
> TV: the text was broadcast live on television.
> Web: the text was distributed on the internet.

Mouse over individual words to get more data on the word:   Frequency in text
the number of times the word appears in this address. Per 10k in text at the
current frequency, how many times the word would occur per ten thousand words.
Per 10k in corpus at the average frequency of the word in all addresses, how
many times the word would occur per ten thousand words. Document frequency the
number of documents that the word appears in (out of 217 as of 2007). Relative
frequency is a measure of how unique the usage of the word is on a scale of 100;
if one president uses a word often, that other presidents do not use so often,
it is likely to be important. Average position if 0 is the first word and 100
the last; low numbers mean the word tends to come at the beginning, high numbers
the end (numbers near 50 could mean the middle or even distribution).


[View instructions]


Email this page to a friend or  


STATE OF THE UNION: APRIL 28, 2021

President Biden will deliver his Address to a joint session of congress on April
28, 2021 (9pm EST).

Have access to the analysis as soon as its ready by signing up for the
announcement email. Use this form.

 

You can make your own word to word comparisons of usage over time with the
SotuGraph tool.

 

Note: It seems odd that contractions like the word "that's" are so prominent in
the last few graphs. The reason: Obama is the first president to ever use
contractions in these official speeches and Trump has followed his precedent.
What is the significance of this change?

State of the Union (SOTU) provides access to the corpus of all the State of the
Union addresses from 1790 to 2020. SOTU allows you to explore how specific words
gain and lose prominence over time, and to link to information on the historical
context for their use. SOTU focuses on the relationship between individual
addresses as compared to the entire collection of addresses, highlighting what
is different about the selected document. You are invited to try and understand
from this information the connection between politics and language–between the
state we are in, and the language which names it and calls it into being.


THE WORDS

SOTU maps the significant content of each State of the Union address so that
users can appreciate its key terms and their relative importance.

The horizontal axis shows the average position of a word in the document. The
vertical axis displays the word’s relative frequency, determined by comparing
how frequently the word occurs in the document to how frequently it appears
throughout the entire body of SOTU addresses (see appendix for details).

Common words (“and,” “the,” etc.) and words that occur frequently in the entire
corpus (“states”) are largely filtered out; what remains are words that are
especially characteristic of a given address. The size of the word indicates how
many times it was used in the document. Click the word to view the full text of
the address with the word highlighted. Rollover the word to get detailed
frequency data.


THE DATA

The data underneath the map of significant words shows trends in the language of
the State of the Union addresses. On the graph, white bars indicate the word
length of each address. The red dots indicate readability as measured by the
address’s Flesch-Kincaid score, which is meant to suggest the grade level in an
American school for which the text is comprehensible. The actual scores are
displayed in the bottom right corner of the interface (for more information on
Flesch-Kincaid, see the appendix).

The current corpus contains 235 documents. There are 1,797,507 words in the
corpus, and 28,273 unique words.

State of the Union is a project by Brad Borevitz at onetwothree.net
AAAAAAAA