from nltk.corpus import wordnet as wn

term, pos = "love", "n"
synsets = wn.synsets(term, pos)
for idx, synset in enumerate(synsets):
    print(f"{idx + 1}. {term}: {synset.definition()}")
    print(f"Synset: {synset}")
    print(f"Examples: {synset.examples()}")
    print(f"Hyponyms: {synset.hyponyms()[:5]}")
    print(f"Hypernyms: {synset.hypernyms()[:5]}")
    print("_"*30 + "\n")

newline = "\n"
print(f"Synomns for {term}: {newline.join([' '.join(s) for s in wn.synonyms(term) if s])}")

1. love: a strong positive emotion of regard and affection
Synset: Synset('love.n.01')
Examples: ['his love for his work', 'children need a lot of love']
Hyponyms: [Synset('agape.n.01'), Synset('agape.n.02'), Synset('amorousness.n.01'), Synset('ardor.n.02'), Synset('benevolence.n.01')]
Hypernyms: [Synset('emotion.n.01')]
______________________________

2. love: any object of warm affection or devotion
Synset: Synset('love.n.02')
Examples: ['the theater was her first love', 'he has a passion for cock fighting']
Hyponyms: []
Hypernyms: [Synset('object.n.04')]
______________________________

3. love: a beloved person; used as terms of endearment
Synset: Synset('beloved.n.01')
Examples: []
Hyponyms: []
Hypernyms: [Synset('lover.n.01')]
______________________________

4. love: a deep feeling of sexual desire and attraction
Synset: Synset('love.n.04')
Examples: ['their love left them indifferent to their surroundings', 'she was his first love']
Hyponyms: []
Hypernyms: [Synset('sexual_desire.n.01')]
______________________________

5. love: a score of zero in tennis or squash
Synset: Synset('love.n.05')
Examples: ['it was 40 love']
Hyponyms: []
Hypernyms: [Synset('score.n.03')]
______________________________

6. love: sexual activities (often including sexual intercourse) between two people
Synset: Synset('sexual_love.n.02')
Examples: ['his lovemaking disgusted her', "he hadn't had any love in months", 'he has a very complicated love life']
Hyponyms: []
Hypernyms: [Synset('sexual_activity.n.01')]
______________________________

Synomns for love: passion
beloved dear dearest honey
erotic_love sexual_love
love_life lovemaking making_love sexual_love
enjoy
bang be_intimate bed bonk do_it eff fuck get_it_on get_laid have_a_go_at_it have_intercourse have_it_away have_it_off have_sex hump jazz know lie_with make_love make_out roll_in_the_hay screw sleep_together sleep_with

# Get the the similarity between synsets to measure their semantic relatedness.
# The similarity is based on the shortest path between synsets.
synset_a = wn.synsets("game", "n")[0]
synset_b = wn.synsets("win", "n")[0]
synset_c = wn.synsets("newspaper", "n")[0]

print("Definitions:")
print(*map(lambda s: f"{s.lemmas()[0].name()}: {s.definition()}", (synset_a, synset_b, synset_c)), sep="\n")
print()
print("Similarities:")
print(f"game -> win {synset_a.path_similarity(synset_b)}")
print(f"game -> newspaper {synset_a.path_similarity(synset_c)}")

Definitions:
game: a contest with rules to determine a winner
win: a victory (as in a race or other competition)
newspaper: a daily or weekly publication on folded sheets; contains news and articles and advertisements

Similarities:
game -> win 0.125
game -> newspaper 0.0625

# ONLY FOR EDUCATIONAL PURPOSES; DO NOT USE IT AS IS FOR ANYTHING!

# A naive function that maps each word in a sentence to its hypernym,
# to obtain a more abstract (and less precise) version of the sentence.
def get_hypernyms(sentence, n):
    """
    Gets the nth hypernym for each noun, verb or adjective in the sentence.
    """
    hypernym_sent = []
    for token, pos in sentence:
        # WordNet does not contain hypernyms for adjectives
        if pos is None or pos == wn.ADJ:
            hypernym_sent.append((token, pos))
        else:
            synset = wn.synsets(token, pos)[0]
            hypernyms = list(synset.closure(lambda s: s.hypernyms()))
            if hypernyms:
                try:
                    hypernym = hypernyms[n]
                    token = hypernym.lemmas()[0].name()
                except IndexError:
                    hypernym = hypernyms[0]
                    token = hypernym.lemmas()[0].name()
            hypernym_sent.append((token, pos))
            
    return hypernym_sent

sentence_a = [("I", None), ("adore", wn.VERB), ("my", None), ("dogs", wn.NOUN), ("and", None), ("cats", wn.NOUN)]
sentence_b = [("He", None), ("was", wn.VERB), ("attacked", wn.VERB), ("by", None), ("a", None), ("lion", wn.NOUN)]

print(get_hypernyms(sentence_a, n=-13))
print(get_hypernyms(sentence_b, n=-13))

[('I', None), ('love', 'v'), ('my', None), ('domestic_animal', 'n'), ('and', None), ('feline', 'n')]
[('He', None), ('was', 'v'), ('contend', 'v'), ('by', None), ('a', None), ('feline', 'n')]

/opt/homebrew/Caskroom/miniconda/base/envs/python_intro/lib/python3.11/site-packages/nltk/corpus/reader/wordnet.py:604: UserWarning: Discarded redundant search for Synset('animal.n.01') at depth 7
  for synset in acyclic_breadth_first(self, rel, depth):

Relation	Definition	Example
Hypernym	Concept to superordinates	breakfast -> meal
Hyponym	Concept to subtype	meal -> lunch
Instance [Hyper-/Hypo]nym	...	Goethe -> author
Part Meronym	From whole to parts	forest -> tree
Part Holonym	From parts to whole	soldier -> army
Antonym	Semantic opposition	cold <-> warm
...	...	...

Data Science for Humanities 2¶

Session: Corpus Linguistics¶

Summer term 25¶

Prof. Goran Glavaš, Lennart Keller¶

Multi-word expressions, collocations, idioms¶

Multi-word expressions¶

Collocations¶

Collocations may specify word senses¶

Non-Compositionality¶

Idioms¶

Non-Substitutability¶

Non-modifiability¶

Lexical association measures¶

Pointwise Mutual Information (PMI)¶

Co-occurence¶

PMI Properties¶

Spatial detection of collocations¶

Lexico-semantic resources: WordNet, BabelNet, PanLex¶

WordNet - Capturing word senses¶

Synsets¶

Sense Relations¶

WordNet - A lexico-semantic Knowledge Graph¶

How to use WordNet?¶

Going beyond English: BabelNet¶

Basics¶

Multilingualism¶

Semantic Network¶

PanLex - Preversing endangered languages¶

Basics¶

Comprehensive Language Coverage¶

Translation Pairs¶