Skip to content

Contains the code to crawl the Anac dataset and jupyter notebooks for explorative analysis

Notifications You must be signed in to change notification settings

fabiana001/anacAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

90 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ANAC analysis

This project allow us to extract Anac dataset, to realize some descriptive analysis, apply machine learning algorithms, and generate a graph of semantic relationships among public administrations and private companies.

We present this project at Data Driven Conference. Here the presentation.

The project in organized in three main sections:

  • crawler: java project for crawling data. Starting from a json file containing information about all public administration, it extract the dataset used for our analysis.
  • notebooks: python project used for analyzing the ANAC dataset extracted by the crawling process.
  • web_api: a docker-composer file which runs a Neo4j instance and a React web-app and visualizes the semantic graph G(V, E), where the nodes in V are public administrations and private companies contained in the ANAC dataset and the edges E are the semantic relationships among nodes.

This is a prototype, so feel free to improve this project o suggest us alternative solutions.

About

Contains the code to crawl the Anac dataset and jupyter notebooks for explorative analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published