This Streamlit application provides an automated solution for data analysis and visualization. It allows users to upload CSV files, perform basic data preprocessing, generate visualizations, and even train simple machine learning models.
- Data upload via CSV file
- Basic data information display
- Handling of missing values
- Data normalization and standardization
- Feature engineering (polynomial features)
- Data visualization:
- Histograms
- Correlation heatmaps
- Scatter plots
- Feature importance calculation
- Machine Learning model training and evaluation
- Automated report generation
- Clone this repository:
git clone https://github.com/saksham-jain177/Automated-Data-Analysis-and-Visualization.git
- Navigate to the project directory:
cd Automated-Data-Analysis-and-Visualization
- Install the required packages:
pip install -r requirements.txt
Run the Streamlit app:
streamlit run auto.py
Then, follow these steps:
- Upload your CSV file
- Choose options for handling missing values and data normalization
- Explore the generated visualizations
- Optionally, train and evaluate machine learning models
- Download the automated report
- streamlit
- pandas
- matplotlib
- seaborn
- plotly
- scikit-learn
For a complete list of dependencies, see requirements.txt
.
Contributions to this project are welcome! Feel free to submit pull requests or open issues to suggest improvements or report bugs.
This project is open source and available under the MIT License.
This project was created as a tool for automated data analysis and visualization. It's designed to be a starting point for data exploration and can be extended with additional features and capabilities.
Any changes and improvements are welcome! If you have ideas to enhance this tool or find any issues, please don't hesitate to contribute or reach out.