Categorical_Hierarchical

In our paper, we show that large language models represent categorical concepts as simplices and hierarchical relations as orthogonality.

We confirm our theory with Gemma-2B representations and this repo provides the code for the experiments.

Data

animals.json and plants.json are sets of words generated by ChatGPT-4.

***_gemma.json and ***_graph.adjlist files in data are the collections of words and hierarchy graph of WordNet for noun and verb, which are obtained by get_wordnet_hypernym.ipynb.

Requirement

You need to install Python packages transformers, networkx, scikit-learn, nltk, inflect, torch, numpy, seaborn, matplotlib, json, and tqdm to run the codes. Also, some GPUs would be helpful to implement the codes efficiently.

Run store_matrices.py first to store the unembedding vectors, before you run other jupyter notebooks.

Experiments

1_Animal.ipynb: We display 2D plots in Figure 2 and 3D plots in Figure 3.
2_Noun_Test.ipynb: We validate the existence of the vector representations for each feature in WordNet noun hierarchy in Figure 4.
3_Noun_Heatmap.ipynb: We confirm that the hierarchical relation in WordNet noun hierarchy is encoded as orthogonality in Figure 5. Also, we zoom in the heatmaps in Figure 8 for the subtree in Figure 7.
4_Verb_Test.ipynb: We validate the existence of the vector representations for each feature in WordNet verb hierarchy in Figure 9.
5_Verb_Heatmap.ipynb: We confirm that the hierarchical relation in WordNet verb hierarchy is encoded as orthogonality in Figure 10.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Categorical_Hierarchical

Data

Requirement

Experiments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
figures		figures
hierarchical		hierarchical
.gitignore		.gitignore
1_Animal.ipynb		1_Animal.ipynb
2_Noun_Test.ipynb		2_Noun_Test.ipynb
3_Noun_Heatmap.ipynb		3_Noun_Heatmap.ipynb
4_Verb_Test.ipynb		4_Verb_Test.ipynb
5_Verb_Heatmap.ipynb		5_Verb_Heatmap.ipynb
README.md		README.md
get_wordnet_hypernym.ipynb		get_wordnet_hypernym.ipynb
store_matrices.py		store_matrices.py

chanind/LLM_Categorical_Hierarchical_Representations

Folders and files

Latest commit

History

Repository files navigation

Categorical_Hierarchical

Data

Requirement

Experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages