Hi,
I've got a question regarding exercise 2.1 and 2.2 from the newest exercise sheet.
As mentioned in the task description, every word should be treated as a distinct feature, but how does that effect the calculation of the probabilities? For example with P(Macao | Yes), is the count of "Macao" 4 (because the actual word count is 4) or 3 (because there are 3 features with "Macao")? And what would be are corresponding denominator respectably - 7, because there are 7 words in total for "Yes", or 3, because the actual count of "Yes" is 3?
Thank you for your help.
Kind regards,
Raphael Teller