Skip to content

clamp-gen/kmedoids

Repository files navigation

How to create AMP prototypes based on e_value obtained from Diamond

  • Step 1: Run python diamond_distance_matrix.py to get the pair-wise distance between all AMP pairs in dataset D1.csv (There is precomputed diamond_distance_matrix in file diamond_distance_matrix.zip for step 1, so you can skip step1)
  • Step 2: Run python diamond_get_amp_prototype.py to write prototypes in file amp_prototype_diamond_distance.txt

kmedoids

Precomputed distance metric on our current set D1 (used for the real oracle) accessible here

To run: python k_medoids_better.py --n_medoids 500

  • This will load the default distance matrix built on the full D1 set (real oracle) - the list of medoids (initially 500), and the full list of associated neighbors (clusters members)
    • If n_medoids != 500, the distance matrix will be used and new set of k-medoids will be created
    • The 500 k medoids (protoypes) are available here
    • The set of members of the cluster of each medoid is available here

The HackMD note summarizing the approach is here

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages