- Overview:
- In this project, 8 Million Github user's activity data was collected, filtered, and analyzed. The user profiles are modelled, and the typical user behavior patterns are identified by applying unsupervised clustering on the follow network.
- Part 1:
- 8 Million Github user's activity data was extracted from historical user activity database on Google BigQuery platform
- Follow network are built using user's activity data
- Various topological features and aggregated behavior features were calculated using social network analysis techniques, and the user profolios were generated
- Part 2:
- User profolio data was carefully explored and cleaned.
- Un-supervised culstering was executed to identify typical user behavior patterns of Github users.
-
Notifications
You must be signed in to change notification settings - Fork 0
gigglegrig/MIE1512-Data-Analytics
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Data analytics project on large scale social-network analysis
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published