Skip to content

jonathanwvd/awesome-industrial-datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Awesome Industrial Datasets

🔗 Check the HTML version for better navigation.

Welcome to the Awesome Industrial Datasets repository! This project aims to simplify the access to high-quality industrial datasets across various sectors such as chemical, mechanical, oil and gas, and more. These datasets are invaluable for researchers, engineers, and data scientists working on machine learning models and other analytical tasks that require real-world industrial data.

If you find this repository useful, please consider giving it a ⭐ to show your support!

🤝 If you're interested in contributing, please refer to the Contribution Guidelines.

Datasets Table

Dataset Name Labeled Time Series Simulation Additional Tags
3W Yes Yes Both Oil and Gas; Real events; Fault detection; Multivariate data; Sensor data; Time-series analysis; Oil wells; Machine learning benchmark
Ai4I 2020 Predictive Maintenance Dataset Yes Yes Yes Predictive maintenance; Synthetic data; Industry 4.0; Machine failure; Time-series data
Aps Failure At Scania Trucks Yes No No Scania trucks; APS system; Component failure; Predictive maintenance; Industrial data
Additional Tennessee Eastman Process Simulation Data Yes Yes Yes Anomaly detection; Process simulation; Human-automated systems; Operational research; Fault analysis
Air Quality Yes Yes No Air quality monitoring; Sensor data; Pollution levels; Time-series analysis; Environmental data
Appliances Energy Prediction No Yes No Indoor environment monitoring; ZigBee wireless network; Temperature data; Humidity data; Weather integration; Energy consumption; M-bus energy meters; Airport weather station
Beijing Pm2 5 Data Yes Yes No Air quality; PM2.5 concentration; Meteorological data; Environmental monitoring; Time-series data
Bosch Production Line Performance Yes No No Manufacturing; Production line; Quality control; Industrial data; Machine learning
Brent Oil Prices Yes Yes No Brent oil; Crude oil prices; Economic indicators; Market analysis; Financial markets
Business And Industry Reports Yes No No Economic data; Industry reports; Business statistics; Market analysis; U.S. Census data
C Mapss Aircraft Engine Simulator Data Yes Yes Yes Aircraft engine; Simulator data; Engine performance; Sensor data; Prognostics
C-Mapss Aircraft Engine Simulator Data Yes Yes Yes Aircraft engine; Simulator data; Engine performance; Sensor data; Prognostics
Cmapss Jet Engine Simulated Data Yes Yes Yes Jet engines; Simulation data; Prognostics; Health management; Aerospace engineering
Cnc Mill Tool Wear Yes Yes No CNC milling; Tool wear detection; Predictive maintenance; Operational efficiency; Industrial sensors
Car Evaluation Yes No No Automobile evaluation; Decision-making; Categorical data; Multivariate data; Classification task
Casting Product Image Data For Quality Inspection Yes No No Quality inspection; Metal casting; Image classification; Defect detection; Manufacturing process
Chemical Composition Of Ceramic Samples Yes No No Ceramic analysis; Chemical composition; X-ray fluorescence; Material science; Historical ceramics
Chemical Production India 2013 To 2020 Yes Yes No Chemical industry; Production data; India; Economic analysis; Time-series forecasting
Civil Engineering Cement Manufacturing Dataset Yes No No Civil engineering; Cement composition; Concrete strength; Structural engineering; Material science
Combined Cycle Power Plant Yes No No Power plant; Energy output; Regression tasks; Environmental data; Multivariate data
Concrete Compressive Strength Yes No No Civil engineering; Material properties; Concrete strength; Regression data; Multivariate data
Concrete Crack Images For Classification Yes No No Concrete; Crack detection; Image classification; Deep learning; Structural health monitoring
Condition Based Maintenance Of Naval Propulsion Plants Yes No No Naval propulsion; Condition-based maintenance; Gas turbine; Simulation data; Performance decay
Condition Monitoring Of Hydraulic Systems Yes Yes No Hydraulic systems; Condition monitoring; Sensor data; Time-series data; Mechanical systems
Data Driven Prediction Of Battery Cycle Life Before Capacity Degradation Yes Yes No Battery life prediction; Lithium-ion batteries; Charge-discharge cycles; Predictive maintenance; Energy storage
Detecting Anomalies In Wafer Manufacturing Yes No No Wafer manufacturing; Sensor data; Defect detection; Anomaly detection; Manufacturing quality
Eco Dataset Yes Yes No Electricity consumption; Occupancy detection; Smart meters; Energy efficiency; Household data
Electrical Grid Stability Simulated Data Yes No Yes Electrical grid; Stability analysis; Smart grid; Simulation data; Physics
Electricity Load Diagrams 2011 2014 Yes Yes No Electricity consumption; Time-series data; Energy monitoring; Smart grid; Urban energy use
Energy Efficiency Yes No Yes Energy efficiency; Building simulation; Heating load; Cooling load; Environmental data
Greend Yes Yes No Energy consumption; Household data; Smart grids; Time-series; Open data
Gas Sensor Array Drift At Different Concentrations Yes Yes No Gas sensors; Sensor drift; Chemical sensing; Time-series data; Environmental monitoring
Gas Sensor Array Temperature Modulation Yes Yes No Gas sensing; MOX sensors; Temperature modulation; Environmental monitoring; Sensor arrays
Gas Sensor Array Under Dynamic Gas Mixtures Yes Yes No Gas sensors; Dynamic mixtures; Sensor data; Time-series data; Chemical sensors
Gas Sensor Arrays In Open Sampling Settings Yes Yes No Gas sensing; Sensor arrays; Environmental monitoring; Chemical detection; Open sampling settings
Genesis Demonstrator Data For Machine Learning Yes Yes INA Robotics; Automation; Sensor data; Machine learning; Industrial systems
Global Power Plant Dataset Yes No No Global energy; Power plants; Renewable energy; Energy statistics; Environmental impact
Green House Gas Produce By Different Industry Yes No No Environmental data; Greenhouse gases; Industry emissions; Sustainability; Emission tracking
High Storage System Data For Energy Optimization Yes Yes INA Energy optimization; High storage systems; Predictive analytics; Sensor data; Industrial energy efficiency
Hill-Valley Yes No No Classification; Feature representation; Graph data; Pattern recognition
International Stiction Data Base Yes Yes No Valve stiction; Control loop analysis; Industrial process optimization; Fault diagnosis
Individual Household Electric Power Consumption Yes Yes No Electric power consumption; Time-series data; Energy monitoring; Smart grid; Household energy use
Industrial Safety And Health Analytics Database Yes No No Workplace safety; Health and safety; Accident reports; Risk management
Li Ion Battery Aging Datasets Yes Yes No Battery health; Prognostics; Electrochemical impedance spectroscopy; Deep discharge; Aging effects
Maintenance Of Naval Propulsion Plants Dataset Yes No Yes Naval propulsion; Predictive maintenance; Operational efficiency; System performance; Maritime data
Manufacturing Cost Yes No No Cost analysis; Production efficiency; Cost reduction; Economic analysis; Operational costs
Manufacturing Defects Yes No No Manufacturing defects; Quality control; Predictive modeling; Industrial analysis; Process optimization
Mechanical Analysis Yes No No Fault diagnosis; Electromechanical devices; Component analysis; Classification tasks; Pump analysis
Mercedes Benz Greener Manufacturing Yes No No Automotive manufacturing; Efficiency optimization; Environmental sustainability; Predictive modeling
Milling Wear Yes Yes No Milling operations; Tool wear analysis; Degradation study; Predictive maintenance; Operational efficiency
Multi Stage Continuous Flow Manufacturing Process Yes Yes No Manufacturing process; Continuous-flow; Sensor data; Process monitoring; Anomaly detection
Nasa Bearing Dataset Yes Yes No Bearing failure; Vibration analysis; Predictive maintenance; Mechanical diagnostics; Operational monitoring
Oecd Data Crude Oil Production Yes Yes No Crude oil; OECD countries; Energy production; Economic analysis; Market trends
Oil Storage Tanks Yes Yes No Satellite imagery; Oil storage tanks; Object detection; Remote sensing; Geospatial analysis
Oil Well Yes No No Oil well; Petroleum engineering; Production optimization; Operational data
Oil And Gas Yes No No Oil industry; Gas industry; Energy sector; Economic analysis; Market trends
Oscillation Detection Artificial Dataset Yes Yes Yes Control loops; Oscillation detection; Machine learning; Process optimization
Phm 2008 Challenge Yes Yes No Aircraft engines; Prognostics challenge; Sensor noise; Operational settings; Engine degradation
Phm Data Challenge Yes Yes No PHM; Fault detection; Prognostics; Industrial monitoring; Time-series analysis; Plant monitoring
Panasonic 18650Pf Li Ion Battery Data Yes Yes No Li-ion battery; State of charge; Battery testing; Energy storage; Neural networks
Parts Manufacturing Yes No No Manufacturing; Industrial data; Quality control; Component measurements; Production optimization
Power Consumption Of Tetuan City Yes Yes No Power consumption; Urban energy use; Time-series data; Weather data; Smart grid
Predicting Manufacturing Defects Dataset Yes No No Manufacturing defects; Quality control; Predictive modeling; Industrial data; Process optimization
Productivity Prediction Of Garment Employees Yes No No Garment industry; Employee productivity; Manufacturing process; Workforce analytics; Performance prediction
Prognostics Data Repository Yes Yes No Prognostics; NASA; Fault detection; Time-series data; System health monitoring
Pump Sensor Data Yes Yes No Sensor data; Pump monitoring; Predictive maintenance; Operational efficiency; Machine learning
Quality Prediction In A Mining Process Yes Yes No Mining industry; Process optimization; Quality control; Predictive analytics; Operational efficiency
Renewable Power Plants Yes No No Renewable energy; Power plants; Solar energy; Wind energy; Hydroelectric power
Robot Execution Failures Yes Yes No Robotics; Failure detection; Force and torque data; Time-series analysis; Machine learning
Sacac INA INA INA Control systems; Performance monitoring; Industrial data; Research; Process optimization
Secom Yes No No Manufacturing; Semi-conductor; Process optimization; Feature selection; Industrial data
Siso Raw No Yes No Control loops; Process monitoring; Oil and gas industry; Data visualization; Process control
Sml2010 Yes Yes No Domotic systems; Environmental monitoring; Home automation; Time-series data; Energy efficiency
Secure Water Treatment Swat Dataset Yes Yes Yes Water treatment; Cybersecurity; Anomaly detection; Sensor data; Time-series data
Severstal Steel Defect Detection Yes No No Steel defects; Surface defects; Industrial quality control; Image classification; Machine learning
Solar Power Generation Data Yes Yes No Solar energy; Power generation data; Environmental factors; Energy forecasting; Renewable energy
Steel Dataset Yes No No Steel manufacturing; Defect detection; Quality control; Industrial inspection; Manufacturing processes
Steel Industry Energy Consumption No Yes No Energy consumption; Steel and iron production; Electricity usage data; CO2 emissions; Korea Electric Power Corporation
Steel Industry Datasets Yes Yes No Steel production; Energy consumption; Manufacturing process; Industrial data; Operational efficiency
Steel Plates Faults Yes No No Steel plates; Fault detection; Manufacturing; Pattern recognition; Classification tasks
Superconductivity Data Yes No No Superconductors; Material properties; Physics; Chemistry; Critical temperature
Reference Energy Disaggregation Data Set Redd Yes Yes No Energy consumption; Residential data; High frequency data; Low frequency data; Circuit level monitoring
Top Defense Manufacturers Yes No No Defense industry; Manufacturer rankings; Market analysis; Financial data; Global defense companies
Turbofan Engine Degradation Simulation Data Set Yes Yes Yes Turbofan engines; Engine degradation; Simulation data; Prognostics health management; NASA dataset
Us Crude Oil Imports Yes Yes No Crude oil; Imports; U.S. economy; Energy economics; Market analysis
Uk Dale Dataset Yes Yes No Energy consumption; Smart homes; High-frequency data; Time-series analysis; Appliance monitoring
Urban Land Cover Yes No No Urban land cover; Aerial imagery; Environmental monitoring; Remote sensing; Land use classification
Vehicle Manufacturing Dataset Yes No No Vehicle production; Manufacturing quality; Operational efficiency; Automotive industry; Product testing
Versatile Production System Yes Yes INA Industrial automation; Sensor data; Production systems; Quality control; Manufacturing processes
Water Distribution Wadi Dataset Yes Yes Yes Water distribution; Cybersecurity; Anomaly detection; Sensor data; Time-series data
Wind Turbine Scada Dataset Yes Yes No Wind energy; SCADA; Operational data; Predictive maintenance; Energy efficiency
Wine Quality Yes No No Wine quality; Physicochemical analysis; Sensory data; Classification tasks; Regression tasks
Iv2V And Iv2I Plus Industrial Datasets Yes Yes No Industrial communication; V2V; V2I; Wireless networks; Signal processing; AGVs; Time-series

Contribution Guidelines

Thank you for considering contributing to our repository.

How You Can Contribute

You can contribute in several ways:

  • Suggest a New Dataset: Propose a new dataset by creating an issue under the "Enhancement" label in the Issues tab.
  • Add a Dataset: Create a JSON file describing a dataset and submit a pull request to add it to the repository.
  • Suggest Changes: You can suggest improvements through the Issues tab or directly edit the JSON files and submit your changes via a pull request.

Adding a Dataset

Before adding a new dataset, please ensure that it is unique and not already included in the repository.

To add a dataset:

  1. Create a JSON file that accurately describes the dataset, following the same template as the existing datasets in the json folder.
  2. Place this JSON file in the json folder.

Updating Documentation

To update the documentation (Markdown and HTML files) and refresh the README:

  1. Run the generate_documentation.py script located in the root of the repository. This script will:
    • Generate Markdown files in the markdown folder.
    • Generate HTML files in the html folder.
    • Update the README.md file with the latest datasets table.

Making a Pull Request

Please adhere to these guidelines when submitting a pull request:

  • Check for Duplicates: Ensure your contribution is unique and not already included.
  • Submit Separate Pull Requests: Submit individual pull requests for each suggestion or dataset.
  • Follow the format: Use our JSON template for datasets and maintain readability and structure in documentation.