Skip to content

Generating a large dataset by fetching data from an API and exporting it to a text file.

License

Notifications You must be signed in to change notification settings

FizzyMo/BeyondBackend.Large-Dataset-Generation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Large Dataset Generation

Provides a script to generate a large dataset from a News API using Node.js. The script fetches data from the API, formats it, and writes it to a text file.

Table of Contents

Introduction

This project demonstrates how to generate a large dataset by fetching data from an API and exporting it to a text file. It uses the News API to gather articles about a specific topic and store the results in a .txt file.

API Details

Used the News API to fetch the latest news articles. Here are the details:

  • Endpoint: https://newsapi.org/v2/everything
  • Parameters:
    • q: The query keyword (e.g., Elon Musk)
    • language: The language of the articles (e.g., en for English)
    • sortBy: The sorting criteria (e.g., publishedAt for the most recent articles)
    • apiKey: Your News API key

Script Explanation

Fetching Data

The script uses the axios library to fetch data from the News API.

Writing Data to File

The fetched data is formatted and written to a .txt file using the fs (File System) module:

Running the Script

To run the script and generate your dataset, use the following command: node index.js

Example Output

After running the script, you will find articles.txt in the root directory with content similar to:

Prerequisites

Before you begin, ensure you have the following installed:

Setup

Cloning the Repository

  1. First, clone this repository to your local machine: git clone https://github.com/yourusername/large-dataset-generation.git cd large-dataset-generation

Installing Dependencies

  1. Install the required Node.js packages: npm install axios dotenv

Environment Variables

  1. Create a .env file in the root directory of the project and add your News API key:
    • npm i dotenv
    • Add the following line to the .env file: NEWS_API_KEY=your_news_api_key_here

Technologies

This project uses the following technologies:

  • Replit: For online coding and development
  • Node.js: JavaScript runtime environment
  • JavaScript: Programming language used for the script
  • axios: Promise-based HTTP client for making API requests

Project Structure

large-dataset-generation
├── index.js          
├── README.md         
├── articles.txt      
├── .env              
├── images              
│   └── example.output.png
│   └── fetch.data.png
│   └── writing.data.png
└── video              
    └── generate_data.gif
            

Author

Carisa Saenz-Videtto

Contact

carisasaenz@gmail.com

About

Generating a large dataset by fetching data from an API and exporting it to a text file.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published