A Frequency Analysis on Wordle (2023)

A Frequency Analysis on Wordle (1)

The game Wordle has won the heart of social media in the past few weeks. Wordle is basically a word game, where the player tries to guess a 5-letter word in 6 guesses (tries), where the player progressively receives more information about the target word. The game is created by Josh Wardle, an artist and engineer. Wordle starts when the player submits their first 5-letter word. Every time a word is submitted, feedback is provided on each letter of the submitted word, indicating if the letter exists in the target word, and if the spot matches that in the target word. Below is a screenshot of the instructions.

A Frequency Analysis on Wordle (2)

Is there a good strategy to play the game? Obviously, prior to entering the first word, the player has no information about the word and it could be one of approximately 15,000 5-letter English words. However, once the first word is submitted, the player will gain more information on letters involved in the target word, depending on the entered word. Is there a good strategy once the player starts receiving feedback? Perhaps there is one. After feedback on the first word is provided, success would depend on many factors including the players vocabulary and how they can narrow down their next guess based on the feedback. However, the choice of the first word is independent of the player’s vocabulary or language skills. That is why, we can perhaps talk about a strategy that would provide the best feedback (one with as much information as possible) after the first word is submitted. Basically, a good strategy for the first entered word would be one that tries to eliminate as many remaining letters as possible. Better yet, a good strategy for the first entered word would be one that can determine as many letters of the target word as possible with as many correct placements of those letters. In this analysis, I am trying to find a strategy, or rather a word, that can serve this purpose.

Based on this article on Wikipedia, the Webster’s Third New International Dictionary of the English Language contains 470,000 entries. However, a portion of these words are obsolete or may not fall into the category of valid single words that contain only letters (no numbers or symbols). I found a dataset of such words at this repository on Github. The file contains 370,103 English words that are single and contain only letters. After extracting only 5-letter words from this list, I was left with a list of 15,918 words. I will explore this list to hopefully gain more insight into a good strategy for the first word entered into Wordle. Perhaps unrelated to this little project, but I was curious to find the distribution of words frequency based on number of letters and the following was the result. Apparently, the frequency is unimodal with a peak at words with 9 letters. The 5-letter words constitute just approximately 4.3% of all words in this list.

A Frequency Analysis on Wordle (3)

Next, I will review two different strategies, the Vowel Strategy and the Frequency Strategy. I will show that the Frequency Strategy is a better strategy and we will pick the best word based on the Frequency Strategy.

(Video) Solving Wordle using information theory

Vowels play an import role when trying to come up with a strategy to eliminate large numbers of words each round. This is because at least one vowel exists in each syllable of the word. There are 5 vowels: A, E, I, O and U. Even though the letter Y can act as a vowel in some words, I did not consider it a vowel here. Starting the search with vowels may be a good idea because every single letter in English must have at least one vowel (well this is not 100% true, as we will find a bit later, we would be able to find 8 words without any vowels, although not bringing the merit of this strategy into question).

I started my search through my list of 5-letter words by finding the number of words with one, two, three, four and five unique vowels. For instance, the word asana has only one unique vowel and the word alibi has two. Turns out, there are 6223, 8568, 1055, 18 and 0 words with 1, 2, 3, 4 and 5 unique vowels, respectively. For example, the words adieu and auloi (plural of Aulos, an ancient Greek wind instrument), Aequi (an ancient Italian tribe) and uraei (plural of Uraeus the upright form of an Egyptian cobra) all have 4 unique vowels. Needless to say, there were no 5-letter words that consisted of only vowels.

There were also 46 5-letter words, where the letter Y acted as a vowel, e.g., in words ghyll (a ravine or narrow valley in the North of England) or Scyld (a legendary Danish king). There were also 8 words without any vowels such as crwth, which is a a type of stringed instrument.

Considering how important vowels are in the English language, a strategy based on vowels would be to use first words that contain as many unique vowels as possible. This will help us determine the existence or absence of as many vowels as possible in the target word. As mentioned above, there are no 5-letter words that consist of only vowels. However, there are 18 words that consist of 4 unique vowels. These words include: adieu, aequi, aoife, audio, aueto, auloi, aurei, avoue, heiau, kioea, louie, miaou, ouabe, ouija, oukia, ourie, ousia and uraei.

One may argue that any of these 18 words would make a good first try at Wordle. However, let’s see if any of the 5 vowels are any more/less frequent in 5-letter words. The following shows the frequency of appearance for each of the 5 vowels in 5-letter words (not counting unique appearances, i.e., for letter A, the word asana counts as 1).

A Frequency Analysis on Wordle (4)

The graph above shows that the vowel U is the least frequent of the 5 vowels. Filtering out from the list of 5-letter words with 4 unique vowels, words that contains U as a vowel, we are left with a list of just two words, Aoife (an Irish feminine given name) and Kioea (a Hawaiian bird that became extinct in the 19th century). A quick search through the list shows that the consonant K appeared in 1663 5-letter words, whereas the consonant F appeared in 1115. Therefore, this strategy would suggest the word Kioea. It is important to mention that this strategy completely ignores the placement of vowels in the word and only determines the existence or absence of them in the target word. We will see in the next section, how the Frequency Strategy outperforms the Vowels Strategy.

The previous strategy only focused on the vowels. This strategy, however will focus on all of the letters. We will evaluate the most frequently used letters in the alphabet and will also determine the most frequent placement of top most frequently used letters in 5-letter words. Based on those, we will determine the best words to be entered first into the game.

I found the frequency of occurrence of each letter in the alphabet in the 5-letter words in the dataset and sorted them from largest to smallest. The following graph shows the frequencies.

A Frequency Analysis on Wordle (5)
(Video) Letter Frequency and Guessing

In the above graph, each occurrence of a letter in a word was counted as 1. So I decided to look at the average frequency of letters per word to see if it was any different from the above. Looking at the average frequency of letters in 5-letter words, I did not see any difference in the order of letters, sorted from most commonly appearing to least commonly appearing (see below).

A Frequency Analysis on Wordle (6)

This means the top most commonly used letters in 5-letter words (in terms of total frequency as well as average frequency) were the letters A, E, S, O, R, I, L, T, etc. I decided to focus on the top six letters since the average frequency dropped significantly after the sixth letter. There are 96 words that are made up of only these letters (repetition allowed). However, if we agree that the purpose of the first letter is to eliminate as many remaining letters (or determine as many letters in the target word) as possible, perhaps we should restrict repetition of letters. If we don’t allow for repetition, the list will reduce to only 12 words. These words are: aesir, aries, arise, arose, ireos, oreas, orias, osier, raise, seora, serai and serio. Which one of these 12 words would be the best first word in Wordle?

To answer this question, I decided to look at the frequency of appearance of each of the top six letters in each spot of the 5-letter words (first letter, second letter, etc.). The result is shown below.

A Frequency Analysis on Wordle (7)

I also calculated the average frequency of the top six letters in 5-letter words to see if it shows any significant difference from the absolute frequencies but it did not turn out to be different. The average frequencies are calculated by dividing the absolute frequencies by the number of 5-letter words, in which that particular letter appears in that particular spot. The average frequency plot is presented below.

A Frequency Analysis on Wordle (8)

This shows for example, that the letter S frequently appears in 5-letter words as the fifth letter, but it is almost never appearing as the third letter. Based on this, I used a simple scoring system to assign a score to each word, which basically consists of the sum of average frequencies for the letters based on above results. This scoring system will assume that the 6 letters are all valued equally and will only focus on frequencies per spot. For example, the score for the letter aesir will be calculated as approximately 0.1619 + 0.2928 + 0.1162 + 0.2771 + 0.1840=1.032, since the average frequency of the letter A in the first spot is 0.1619, average frequency of the letter E in the second spot is 0.2928, and so on. The table and figure below show the calculated score for all 12 words in the list.

A Frequency Analysis on Wordle (9)
A Frequency Analysis on Wordle (10)
(Video) What's the Hardest Answer in Wordle?

Based on this analysis, the word Aries (Latin word for ram) has the highest calculated score. It is shown that if used as the first word entered into Wordle, on average, the word Aries can determine the largest number of letters in the target word.

A Frequency Analysis on Wordle (11)

To test the effectiveness of Aries to identify letters in the target word, I used a random selection of 5000 words from the list of 5-letter words, and calculated how many letters, on average, would be indicated when the word Aries is used as the first word on Wordle. I replicated this process 10 times. The following shows that the average number of letters (per word), whose existence in the target word identified after Aries was used as first word, was between 2.055 and 2.1. Please note, the following result does not separate letters, whose spot was correctly identified and those who weren’t. It simply includes all the letters that were identified in the target word. In other words, all the letters that turn Gold and Green after the word was entered.

A Frequency Analysis on Wordle (12)

I conducted the same analysis for the word Kioea (which was suggested by our Vowels Strategy), and the result was an average of only 1.79 letters identified. This is an indication that the Frequency Strategy was superior in indicating letters in the target word to the Vowel Strategy.

Next, I calculated the average number of letters (per word), whose actual spot in the target word was correctly identified by the word Aries. This means, not only is the letter identified, but its spot in the target word is also correctly identified. In other words, this is the average number of letters that turn Green after the word is entered. For the simulation I again used 10 replications and 5000 randomly selected words in each replication. The following shows the results for Aries.

I ran the same analysis for all the 12 words in the list of top words to see if any of them could beat Aries. As expected, the word Aries demonstrated the highest value for average number of letters (per target word), whose spots were correctly identified. For this analysis also I used 10 replications and 5000 randomly selected words in each replication and reported the average across all 10 replications.

(Video) How to View Word Count Frequency in Wordle Word Cloud

A Frequency Analysis on Wordle (14)
A Frequency Analysis on Wordle (15)

Based on the results of this study, if used as the first word, the word Aries can correctly identify the existence of approximately 2.07 letters on average and the correct spot of approximately 0.6 letters, on average, will be correctly identified.

A Frequency Analysis on Wordle (16)

I realized later that, unfortunately, Aries is not a word on Wordle’s list of accepted words, and neither are the next best words on the list Orias and Serio (based on the word scores identified above). The next best word on the list was serai, which is another word for caravanserai or inn and is indeed on Wordle’s list of accepted words. The origin of the name is Persian and Turkish, with slightly different pronunciations (saray or sarāī, also see caravanserai). In terms of average frequency of letters and letter spots identified in our testing model, both serai and Aries have the same average frequency of letters in target word correctly identified (approximately 2.07 letters on average). However, the word serai has a slightly lower average frequency of letter spots correctly identified (approximately 0.47 compared to 0.58 for Aries). Below, you see serai used as first word on the Wordle of January 16, identifying the existence of 3 letters, with the spot of two of them correctly identified.

A Frequency Analysis on Wordle (17)

In conclusion, I am not sure if the selection of words for Wordle is a completely random process. You may argue that some words may have had some reference to daily global events (see here for a list of past Wordle words in 2022). And after all, it may not be too much fun playing based on an analysis or strategy.

(Video) Simulating Wordle: in search of the perfect strategy

Happy Wordling everyone (although Wordling is probably not on Wordle’s list of accepted words)!

FAQs

What is the letter frequency position in Wordle? ›

Over 15% of Wordle's words of the day start with S. Only six other starting letters appear in more than 5% of Wordle words. In order of frequency, they are C, B, T, P, A, and F. These starting letters might seem pretty surprising, but they are close to the order of general five-letter words.

What is the average score on Wordle? ›

Wordle has 2,308 possible answers and 12,545 allowed guesses. The global average for solving Wordle is 4.016 guesses, with Sweden being the best country (3.72) and Egypt the worst (4.42). The best word to start with is “SALET”, followed by “CRATE”, “TRACE”, “SLATE”, and “REAST”.

How do you Analyse Wordle? ›

WordleBot is a tool that will take your completed Wordle and analyze it for you. It will give you overall scores for luck and skill on a scale from 0 to 99 and tell you at each turn what, if anything, you could have done differently — if solving Wordles in as few steps as possible is your goal.

What are the odds of getting Wordle on the first try? ›

And the first result that popped up from Real Statistics Using Excel (which seemed credible) said: “Since there are 2,315 possible target words in Wordle, the probability that you will guess the target in exactly one try is 1/2315 = 0.000432.

What is the most common letter in Wordle? ›

The most common letters used in Wordle are E R A O T, according to an analysis of 221 games from Christopher Ingraham, a former Washington Post reporter. Context: Invented by Josh Wardle, a software engineer in Brooklyn, to amuse his friends and partner, Wordle has become a daily obsession for many ( 🙋).

What is the least popular letter in Wordle? ›

The least common letters in all words are the usual suspects: J, Q, Z, X, and it's unlikely any five-letter Wordle word would contain any of those characters. F, V, and K are also uncommon, but these letters have higher odds of being in one of the five possible Wordle positions.

Is 3.7 A good Wordle score? ›

Sweden is the world's best country at Wordle, with an average score of 3.72. The United States ranked No. 18 in the world for Wordle, with a national average of 3.92. The U.S. state with the best Wordle average was North Dakota, with an average of 3.65.

Has anyone ever got Wordle on the first try? ›

Based on his findings, O'Connor has determined that approximately 1% of the players who post their results to Twitter are guessing the correct word on their first attempt, and somewhere between 3% and 9% guess correctly on the second try.

What is the best Wordle score ever? ›

This European Country Has The Best Wordle Score In The World, Study Shows. According to the study by word site Word Tips, Sweden comes out on top being able to get the right answer in 3.72 guesses. (For anyone unfamiliar with how Wordle works, when it comes to scoring, the lower the better.)

What's the best strategy for Wordle? ›

Wordle: The Best Strategy For The Game
  1. 1 Try Even If It Looks Wrong.
  2. 2 Words Rarely End In S. ...
  3. 3 Cross Out One Letter At A Time. ...
  4. 4 Find Go-To Words. ...
  5. 5 Skip The Correct Letters In The Second Guess. ...
  6. 6 Remember The Statistics. ...
  7. 7 Use Distinct Words For First 2 Guesses. ...
  8. 8 Eliminate Vowels First. ...
Jul 9, 2022

What is the difference between luck and skill in Wordle? ›

The skill score determines what you did to “minimize the expected number of turns it would take to solve the puzzle.” The luck score checks if “your guesses eliminate more solutions than expected.” After that, WordleBot offers advice. WordleBot wants to make you better at Wordle.

How do you play Wordle smartly? ›

Wordle tips and tricks
  1. Avoid repeating letters that have already been marked in gray.
  2. Yellow-marked letters in the same position should be avoided so you don't lose out on your limited chances.
  3. Remember that the same letter can appear twice in a word. ...
  4. Try and focus on vowels for the first guess.
Oct 12, 2022

What percentage of people get Wordle in 2 tries? ›

So if we guess BIPED to start out with, there's a 91/2315 = 3.93% chance that we get the Wordle in two (so long as our second guess is from the 2315 word answer list). A qajaq!

What are the odds of getting a second guess on Wordle? ›

Based on the data shared by the unofficial account, Wordle Stats, of the 241,489 players who shared their Wordle results on Twitter on January 22, 2022, approximately 1% of them solved it on the first try, 3% on the second try, 17% on the third, 33% on the fourth guess, 29% on the fifth attempt, and 15% on the final .. ...

What is the number 1 most used first word in Wordle? ›

Sorry Bill Gates, but AUDIO isn't the best word to start with when you're playing Wordle. A pair of MIT researchers recently set out to find the optimal starting word for the popular online puzzle, discovering that the statistically superior first guess is SALET, which is a 15th century helmet.

What are the 3 best words to start with in Wordle? ›

If, on the other hand, you're simply trying to win within the allotted six guesses, the top three words to play are “adept,” “clamp” and “plaid.” Using any of these three words will yield an average success rate in winning the game of 98.79 percent, 98.75 percent, and 98.75 percent, respectively, if you're playing the ...

Is there a pattern to the Wordle? ›

Possible Patterns

Thus, there are 243 (= 35) possible Wordle color codes for any guess. We will call this a “pattern”, which consists of a 5-letter word using the letters “G”, “Y”, or “*”. Here, “G” stands for green, “Y” stands for yellow, and “*” for grey. The 243 possible patterns are shown in Figure 4.

What is the best first word for Wordle unlimited? ›

Some Wordle players have found success in starting with a word that has several vowels in it. “Adieu,” “audio” or “canoe,” for instance, may be good words to start with because at least three out of the five letters are vowels.

What is a good Wordle distribution? ›

If you can consistently get Wordle in three guesses, you are pretty darn good. A score of three is solidly above average, and it is certainly nothing to frown at. Especially with harder words such as “cynic,” “vivid,” or “swill,” getting it in three is very good.

What has been the most difficult Wordle? ›

The Top 25 Hardest Wordle Words of 2022 (And What They Mean)
  • trice.
  • knoll.
  • smite.
  • tacit.
  • atoll.
  • piney.
  • trope.
  • swill.
Dec 16, 2022

What is a good Wordle win percentage? ›

While most puzzles have a 99% solve rate, and even tough puzzles have a solve rate in the 80% range, today's Wordle has a solve rate of only 45%. In fact, WordleBot claims that today's puzzle has an average solve rate of 6.3 answers....which means that most players aren't getting the puzzle correct.

What percentage of players solve Wordle? ›

“Wordle” players are quite good at the game — or at least that's what they say. Seventy-four percent of players said they successfully solve the puzzle either “always” or “sometimes.” Meanwhile, 17 percent said they solve it “rarely” and only 9 percent said they “never” complete the puzzle.

What 5 letter word has the most vowels? ›

Top-Words with 5 Letters with mostly vowels
  • MIAOU. ...
  • ADIEU. ...
  • AUDIO. ...
  • AULOI. ...
  • LOUIE. ...
  • AUREI. ...
  • OURIE. ...
  • URAEI.

What was the first ever word for Wordle? ›

Wordle's original word list included ZIZEL and GOLPS.

What is the best 5-letter word for Wordle? ›

11 unusual 5-letter words to kick off your next Wordle game
  • DUCAT. ...
  • OUIJA. ...
  • CAROM. ...
  • ERGOT. ...
  • CRAIC. ...
  • SQUAB. A young unfledged bird, especially a pigeon.
  • ENOKI. An edible mushroom with a long, slender stem, a small, yellowish cap, and yellowish gills.
  • AZURE. Azure is used to describe things that are bright blue.
Feb 1, 2022

Does Wordle indicate intelligence? ›

But does being good at Wordle mean you're smarter than the average person, or even a fellow puzzler? “No,” said memory and learning researcher Aaron Seitz, a professor of psychology at the University of California, Riverside, who founded the university's Brain Game Center.

Does Wordle determine intelligence? ›

According to Little, “retrieving words and checking if they fit the Wordle clues would involve working memory, which is the ability to keep information active, manipulate, and update information. And working memory is correlated with intelligence.”

Does playing Wordle help your brain? ›

Games like Wordle are a great stimulation activity that protect brain function and help prevent dementia and cognitive decline.

Does everyone get the same word on Wordle? ›

We're now busy revamping Wordle's technology so that everyone always receives the same word. We are committed to ensuring that tens of millions of people have a gratifying and consistent experience, every day.

What is the longest winning streak on Wordle? ›

Spencer Evans was the one whose accomplishment impressed me the most: 83 unbroken wins and counting. A wordsmith himself, Evans first discovered Wordle earlier this month.

What is a good second guess for Wordle? ›

The best second word is “PLUTO”

In theory, Wordle doesn't let you guess proper names, but “pluto” is fair game for some reason. This is a great second word, and I advise you to guess it then regardless of the outcome of your first word.

How does Wordle show if a letter repeats? ›

Duplicate letters commonly use a double-letter pairing. An example of this is “knoll,” a previous Wordle answer that stumped many players. These double letters — double-L in the case of “knoll” — usually show up at the end of a word.

What is the second most common letter in Wordle? ›

We see from Figure 1 that “e” is the letter that is most frequently used (1,233 times in total), followed by “a”, “r”, “o”, “t”, “l”, “i”, “s”, “n”, “c”. The most frequently used letter in the first position is “s”, while the most frequently used letter in the second or third position is “a”.

How do you know which letters are right in Wordle? ›

A yellow tile indicates that you picked the right letter but it's in the wrong spot. The green tile indicates that you picked the right letter in the correct spot. The gray tile indicates that the letter you picked is not included in the word at all.

Which letters in Wordle are correct? ›

To solve Wordle as efficiently as possible, try words that include the letters e, t, a, i, o, n, s, h, and r; these are the most common letters in English. Another great trick is to begin with words that start with the letters t, a, o, d, and w; as again, these are the most common starting letters in English.

What is the best starting word for Wordle? ›

Sorry Bill Gates, but AUDIO isn't the best word to start with when you're playing Wordle. A pair of MIT researchers recently set out to find the optimal starting word for the popular online puzzle, discovering that the statistically superior first guess is SALET, which is a 15th century helmet.

Can there be plurals in Wordle? ›

Wordle is a popular word-guessing game from the New York Times. Wordle, the popular word-guessing game from the New York Times, will no longer use plural words, according to an announcement made last week. Over the past year, tens of millions of people tried to guess the five-letter word of the day.

Will Wordle tell you if there are two letters? ›

There is a way to see whether a Wordle solution includes a double letter via the game's established cue system. If you enter a word with two or more of the same letter as an assumption, it will be treated the same as any other attempt.

What is the rarest letter? ›

The rarest letters in English are j, q, x, and z.

Has anyone gotten the Wordle on the first try? ›

Based on his findings, O'Connor has determined that approximately 1% of the players who post their results to Twitter are guessing the correct word on their first attempt, and somewhere between 3% and 9% guess correctly on the second try.

What are the 5 most used letters in Wordle? ›

As for the letters that begin the most English words, the top five are T, O, A, W, and B. For the end letter, the most common are E, S, T, D, and N.

What letters appear most often in 5 letter words? ›

This means the top most commonly used letters in 5-letter words (in terms of total frequency as well as average frequency) were the letters A, E, S, O, R, I, L, T, etc. I decided to focus on the top six letters since the average frequency dropped significantly after the sixth letter.

What are the odds of getting Wordle on the second try? ›

While the figure of 6.5% assumes perfect play, if you're picking any reasonably sensible first guess, and then something consistent with that guess on your second try, your odds are still over 4%.

What is the longest streak on Wordle? ›

Spencer Evans was the one whose accomplishment impressed me the most: 83 unbroken wins and counting. A wordsmith himself, Evans first discovered Wordle earlier this month.

Videos

1. Answering Wyn Hopkins Wordle Challenge
(MrExcel.com)
2. L06 – Wordle and Word Graphs
(AuditNet LLC)
3. Wordle Data Analysis
(Stockton Cannon)
4. How Wordle can Make You a Better Poker Player
(Poker Giraffe)
5. What is the best word to start Wordle with
(Jadi)
6. Using Wordle to Analyze Text
(Kristi Reitz)
Top Articles
Latest Posts
Article information

Author: Msgr. Refugio Daniel

Last Updated: 03/29/2023

Views: 5293

Rating: 4.3 / 5 (74 voted)

Reviews: 81% of readers found this page helpful

Author information

Name: Msgr. Refugio Daniel

Birthday: 1999-09-15

Address: 8416 Beatty Center, Derekfort, VA 72092-0500

Phone: +6838967160603

Job: Mining Executive

Hobby: Woodworking, Knitting, Fishing, Coffee roasting, Kayaking, Horseback riding, Kite flying

Introduction: My name is Msgr. Refugio Daniel, I am a fine, precious, encouraging, calm, glamorous, vivacious, friendly person who loves writing and wants to share my knowledge and understanding with you.