Probstat/random word

จาก Theory Wiki
ไปยังการนำทาง ไปยังการค้นหา

Download words.txt which is a list of common 4325 English words. (I obtain this list from [1], which took it from [2]. The original list of 5000 words contain duplicates because some word appears in many parts of speech. I also removed words with punctuation marks (e.g., o'clock).)

Random experiment: Pick one random word from the list.

Probability

Compute the probabilities of the following events.

  • Pick a word with an alphabet 'a'.
  • Pick a word witn an alphabet 'z'.
  • Pick a word with both alphabet 'a' and alphabet 'z'.
  • Pick a word with an alphabet 'b'.
  • Pick a word with alphabets 'a' and 'b'.
  • Pick a word that contains 'a' or 'b'. (Can you find this one without running a program?)
  • Pick a word that contains 'na'.
  • Pick a word that contains 'nal'.
  • Pick a word that contains more than 3 vowels.

Conditional probability

Compute the conditional probabilities of the following events.

Word prediction