.. | ||
clustering | ||
feature_vectors | ||
god_classes | ||
.gitignore | ||
extract_feature_vectors.py | ||
find_god_classes.py | ||
ground_truth.py | ||
hierarchical.py | ||
k_means.py | ||
keyword_list.txt | ||
README.md | ||
requirements.txt | ||
silhouette.py |
IMA - God classes project
Xerces2
- Download: https://dlcdn.apache.org//xerces/j/source/Xerces-J-src.2.12.2.zip
- extract and rename folder to
xerces2
Running
Part 3: clustering and silhouette metric
To compute optimal k-means and agglomerative clusterings using silhouette validation for all classes run:
./silhouette.py --validate --autorun
To compute k-means or agglomerative clustering for a specific number of
clusters K
and a specific class KLASS
run respectively:
./k_means.py KLASS K
./hierarchical.py KLASS K
Then, to check their silhouette metric run:
./silhouette.py
Install dependencies
# create venv
python -m venv env
source env/bin/activate
pip3 install -r requirements.txt