A data collection and processing tool for gathering name statistics from various national databases.
- USA Social Security Administration (SSA) Names Database
- Spain National Statistics Institute (INE) Names Database
.
├── names_data_sources/
│ ├── USA_names_ssa/ # US Social Security Administration name data
│ └── Spain_names_ine/ # Spanish INE name data
└── requirements.txt # Project dependencies
- Create a virtual environment using uv:
uv venv
source .venv/bin/activate # On Unix/macOS
# or
.venv\Scripts\activate # On Windows
- Install dependencies:
uv pip install -r requirements.txt
Each data source has its own set of scripts for downloading and processing data. Navigate to the specific data source directory and run the main script:
cd names_data_sources/USA_names_ssa
uv run main.py
- USA Names: Data obtained from the U.S. Social Security Administration (www.ssa.gov)
- Spain Names: Data obtained from Instituto Nacional de Estadística (www.ine.es)