Websites Monitoring CLI Application

An interactive shell that allows users to monitor websites' performance, gathering statistics about average response time, maximum responses time, availability and HTTP status codes count.

Features

Monitor websites using an interactive shell
Observe the statistics for all the websites you add in a single updating terminal window
Send GET requests to a website over custom defined frequency
Obtain useful statistics such as average response time, maximum responses time, availability of a given website and HTTP status codes count
Get alert messages when website is down and when it has recovered

Tech stack

TypeScript
Node.js
NestJS - progressive Node.js framework
Vorpal - uninterrupted user input
Winston - logger
Boxen & Table - console output formatting
Moment.js - working with time
Jest - unit testing
ESLint - code analysis
Prettier - code formatting

Installation

Prerequisites

Node.js - version 16.13.0 is recommended, at least 12.22.7 is required
Git - not required if you download project as a zip file

Setup

Install the application with Git:

$ git clone https://github.com/kamranpoladov/websites-monitoring.git
$ cd websites-monitoring

Copy the contents of .env.dist file into .env file located in the root of the project directory either manually or with one of the commands below:

Linux or macOS:

$ cp .env.dist .env

Windows:

$ copy .env.dist .env

Feel free to adjust the configurations inside .env following the guidelines to change intervals, durations, etc.

Install the dependencies with npm:

$ npm install

Build the project:

$ npm run build

Usage

Disclaimer

It is strongly recommended using the application in full-screen mode since the statistics for all the websites added are displayed in a single window. Because of the non-responsive nature of terminal, changing the width and height parameters may lead to disturbed output.

Starting the application

$ npm start

Keeping track of validation errors

In order to keep track of input validation errors, you can also run the following command in a separate window:

$ npm run start:errors

However, this is completely optional as all the validation errors are going to be saved in errors.log file which you can also review manually. Note that you should not run this script before starting the application with npm start though.

Starting the application in DEV mode

You can also alternatively start application in dev mode in order to skip npm run build stage every time code is changed using :dev suffix:

$ npm run start:dev
$ npm run start:errors:dev

After starting the application

An interactive shell will appear in your command line (wait to see >> symbol)

Start using this shell to monitor different websites

Available commands

monitor <website> <interval> - adds a new website (has to be a valid, unique in the scope of application URL, adding protocol is optional but required for HTTPS connections) to be monitored over a specified interval (number in seconds, minimum is 3)
help - lists all available commands
exit - exits the application (this will not close any of the windows displaying stats for the websites you have already added)

Examples

$ npm start
>> monitor datadoghq.com 5

>> monitor invalid_website 6

   ┏ Validation failed ━┓
   ┃                    ┃
   ┃   Url is invalid   ┃
   ┃                    ┃
   ┗━━━━━━━━━━━━━━━━━━━━┛

>> monitor localhost:3000 2

   ┏━━━━━━━━━━━━━━━━━━━━━━━━━ Validation failed ━━━━━━━━━━━━━━━━━━━━━━━━━┓
   ┃                                                                     ┃
   ┃   Interval has to be a positive number greater than or equal to 3   ┃
   ┃                                                                     ┃
   ┗━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┛

Interface

Interface represents a table with all the websites added during program uptime. "Type" column represents whether stats are for "Short" (10 minutes) or "Long" (1 hour) intervals. Stats are updated automatically as program is running. You can add more websites to be monitored using the commands above while viewing this table. However, please note that the height of this table is limited to the size of your terminal window. Therefore, it is not recommended adding too many websites in a single instance of program (more details on "why" in further thoughts)

╔═══════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════╗
║                                                                        STATS FOR ALL WEBSITES                                                                         ║
╟───────────────────────┬──────┬──────────┬──────────┬──────────────┬───────────────────┬───────────────────┬─────────────┬─────────────────────────────────────────────╢
║        Website        │ Type │   From   │    To    │ Availability │ Avg response time │ Max response time │ Http status │                   Alerts                    ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║ https://datadoghq.com │  S   │ 00:41:39 │ 00:51:38 │     100%     │      233 ms       │      515 ms       │ 200 => 200  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║                       │  L   │ 00:34:47 │ 00:50:47 │     100%     │      237 ms       │      998 ms       │ 200 => 320  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║  https://google.com   │  S   │ 00:41:44 │ 00:51:44 │     100%     │      194 ms       │      549 ms       │ 200 => 120  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║                       │  L   │ 00:34:54 │ 00:50:54 │     100%     │      193 ms       │      549 ms       │ 200 => 192  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║ https://facebook.com  │  S   │ 00:41:44 │ 00:51:41 │     100%     │      343 ms       │      510 ms       │ 200 => 100  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║                       │  L   │ 00:35:01 │ 00:51:01 │     100%     │      345 ms       │      727 ms       │ 200 => 160  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║  https://youtube.com  │  S   │ 00:41:45 │ 00:51:42 │     100%     │      476 ms       │      1132 ms      │ 200 => 150  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║                       │  L   │ 00:35:12 │ 00:51:12 │     100%     │      470 ms       │      1372 ms      │ 200 => 240  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║ http://localhost:3000 │  S   │ 00:41:42 │ 00:51:42 │     57%      │       2 ms        │       15 ms       │  200 => 55  │ Went down with availability 0% at 00:35:22  ║
║                       │      │          │          │              │                   │                   │  201 => 14  │ Recovered with availability 83% at 00:43:27 ║
║                       │      │          │          │              │                   │                   │  404 => 14  │ Went down with availability 75% at 00:45:52 ║
║                       │      │          │          │              │                   │                   │  501 => 15  │                                             ║
║                       │      │          │          │              │                   │                   │  502 => 22  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║                       │  L   │ 00:35:22 │ 00:51:22 │     35%      │       2 ms        │       15 ms       │  200 => 53  │                                             ║
║                       │      │          │          │              │                   │                   │  201 => 14  │                                             ║
║                       │      │          │          │              │                   │                   │  404 => 13  │                                             ║
║                       │      │          │          │              │                   │                   │  501 => 14  │                                             ║
║                       │      │          │          │              │                   │                   │  502 => 98  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║ http://localhost:3001 │  S   │ 00:41:43 │ 00:51:41 │     57%      │       2 ms        │       15 ms       │  200 => 85  │ Went down with availability 0% at 00:35:31  ║
║                       │      │          │          │              │                   │                   │  502 => 65  │ Recovered with availability 83% at 00:47:39 ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║                       │  L   │ 00:35:31 │ 00:51:31 │     34%      │       2 ms        │       17 ms       │  200 => 82  │                                             ║
║                       │      │          │          │              │                   │                   │ 502 => 158  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║   https://apple.com   │  S   │ 00:41:37 │ 00:51:37 │     100%     │      171 ms       │      205 ms       │  200 => 60  │                                             ║
╟───────────────────────┼──────┼──────────┼──────────┼──────────────┼───────────────────┼───────────────────┼─────────────┼─────────────────────────────────────────────╢
║                       │  L   │ 00:35:57 │ 00:50:57 │     100%     │      169 ms       │      258 ms       │  200 => 90  │                                             ║
╚═══════════════════════╧══════╧══════════╧══════════╧══════════════╧═══════════════════╧═══════════════════╧═════════════╧═════════════════════════════════════════════╝

>> monitor anotherwebsite.com 4

Code examples

"Any fool can write code that a computer can understand. Good programmers write code that humans can understand" - Martin Fowler

Regardless of this famous quote, the implementations of the most important features of the application along with explanations can be found in this section

Periodic website fetching and statistics displaying

SchedulerService is used to set up the jobs to be run periodically. It is based on ScheduleModule of @nestjs/schedule library but provides additional functionality such as optionally firing off the job function immediately and two types of callbacks to be run only once after or before a job starts

src/Providers/scheduler/scheduler.service.ts

public async addJob({
  name,
  period,
  job,
  afterStart,
  executeImmediately,
  beforeStart
}: AddJobDto) {
  await beforeStart?.(); // execute beforeStart callback if it was provided

  await this.executeAndScheduleJob({
    name,
    job,
    period,
    executeImmediately
  });

  afterStart?.(); // execute afterStart callback if it was provided
}

private async executeAndScheduleJob({
  name,
  job,
  period,
  executeImmediately
}: Omit<AddJobDto, 'callback'>): Promise<void> {
  if (executeImmediately) await job();
  const interval = setInterval(job, period.asMilliseconds());
  this.schedulerRegistry.addInterval(name, interval);
}

StatsService consumes SchedulerService to set up periodic fetching of a website and logging of statistics for both short (every 10 seconds) and long (every minute) intervals. Note that the job for displaying the short stats starts only after the website has been fetched for the first time. This is done to handle the edge case of the very first request giving timeout error: if it takes a website too long to respond (i.e. longer than 10 seconds) for the first time, the statistics should not be displayed as we have no responses to analyze yet. Also, the displaying of long stats is delayed since for the first 10 minutes they are going to be the same as short stats.

Method below listens to the events emitted each time a new website is added (see src/Modules/shell/shell.service.ts)

src/Modules/stats/stats.service.ts

@OnEvent(MonitorWebsiteEvent.eventName)
public async monitor({ website, interval }: MonitorWebsiteEvent) {
  const startStatsShortJob = () =>
    this.schedulerService.addJob({
      name: `${website}#${DISPLAY_STATS_SHORT_JOB_KEY}`,
      period: this.appConfigService.shortInterval,
      job: async () =>
        this.updateStats(
          website,
          this.appConfigService.shortDuration,
          StatsType.Short
        ),
      executeImmediately: true
    });

  const startStatsLongJob = () =>
    this.schedulerService.addJob({
      name: `${website}#${DISPLAY_STATS_LONG_JOB_KEY}`,
      period: this.appConfigService.longInterval,
      job: async () =>
        this.updateStats(
          website,
          this.appConfigService.longDuration,
          StatsType.Long
        ),
      executeImmediately: true
    });

  await this.schedulerService.addJob({
    name: `${website}#${FETCH_JOB_KEY}`,
    period: interval,
    job: () => this.fetchWebsite(website),
    afterStart: async () => {
      startStatsShortJob();
      setTimeout(
        () => startStatsLongJob(),
        this.appConfigService.shortDuration.asMilliseconds()
      );
    },
    beforeStart: async () =>
      this.statsRepository.initializeEmptyStats(website), // create new stats instance before fetching every website
    executeImmediately: true
  });
}

Alerting logic

Alerts are implemented using EventEmitter to avoid circular dependency between AlertService and ResponseService

ResponseService emits an event every time a new response is registered

src/Modules/response/response.service.ts

public async registerResponse(url: string): Promise<void> {
  const httpResponse = await this.httpService.fetch(url);

  const response = plainToClass(ResponseModel, {
    ...httpResponse,
    registeredAt: moment().toISOString()
  });

  this.responseRepository.addResponse(response);

  // notify AlertService about new registered response
  await this.eventEmitter.emitAsync(
    RegisterResponseEvent.eventName,
    new RegisterResponseEvent(url)
  );

  return response;
}

AlertService listens to this event and consumes ResponseService to determine whether website is down or has recovered and adds according AlertModel entity to the StatsRepository. Therefore, all alerts are stored per each website in a unified storage inside StatsRepository

src/Modules/alert/alert.service.ts

@OnEvent(RegisterResponseEvent.eventName, { async: true, nextTick: true })
public onRegisterResponse({ website }: RegisterResponseEvent): void {
  const start = moment().subtract(this.appConfigService.downCheckDuration);
  const now = moment();
  const interval = new Interval({ start, end: now });

  const availability = this.responseService.getAvailability(
    website,
    interval
  );
  const targetAvailability = this.appConfigService.alertAvailability;
  const currentlyDown = this.alertRepository.getIsDown(website);

  this.alertRepository.setIsDown(website, availability < targetAvailability);

  if (availability < targetAvailability && !currentlyDown) {
    this.statsRepository.addAlertForWebsite(website, {
      time: now,
      availability,
      type: AlertType.Down
    });
  } else if (availability >= targetAvailability && currentlyDown) {
    this.statsRepository.addAlertForWebsite(website, {
      time: now,
      availability,
      type: AlertType.Recover
    });
  }
}

Optimization

Fetched responses are being stored in-memory inside an array. To reduce the memory usage, cron job that runs every 30 minutes is implemented to delete the old responses from memory. A response is considered old if it was registered more than one hour ago (the time of displaying long statistics)

src/Modules/response/response.service.ts

@Cron(CronExpression.EVERY_30_MINUTES)
private deleteOldResponses() {
  const time = moment().subtract(this.configService.longDuration);

  this.responseRepository.clearBeforeTime(time);
}

Stats are calculated "on the go", analyzing the in-memory responses over certain interval. Since responses are always sorted by their registration time (because a new response is always pushed to the end of array), binary search is used to quickly find all responses over a certain interval

src/Modules/response/response.repository.ts

public getResponsesForWebsitePerInterval(
  website: string,
  interval: Interval
): ResponseModel[] {
  return this.getResponsesForInterval(interval).filter(
    response => response.website === website
  );
}

private getResponsesForInterval(interval: Interval): ResponseModel[] {
  const startIndex = this.getClosestResponseIndexToTime(interval.start);
  const endIndex = this.getClosestResponseIndexToTime(interval.end);

  return this.responses.slice(startIndex, endIndex);
}

// since responses are always sorted by registeredAt, use binary search for quicker find operations
private getClosestResponseIndexToTime(time: Moment): number {
  let [start, end] = [0, this.responses.length];

  while (start < end) {
    const mid = Math.floor((start + end) / 2);
    if (this.responses[mid].registeredAt === time) {
      return mid;
    } else if (this.responses[mid].registeredAt < time) {
      start = mid + 1;
    } else {
      end = mid - 1;
    }
  }

  return start;
}

Testing

Unit tests are implemented to test alerting mechanism.

alert-listener.spec.ts ensures that events are correctly emitted and listened to in order to trigger alert

alert-logic.spec.ts ensures that AlertService correctly determines the type of alert to be triggered and whether it should be triggered at all

Scripts

npm run test - run all tests
npm run test:types - run the test for TypeScript types compatability
npm run test:lint - run ESLint code analysis test
npm run fix:lint - fix ESLint errors
npm run test:prettier - run Prettier code formatting test
npm run fix:prettier - fix Prettier errors

Further thoughts

The application is refreshing data inside terminal window every time stats for any of the websites should update. Even though user input is being preserved between inputs, the outputs for invalid inputs will be cleared as soon as window is refreshed. This rather happens because of the nature of Node.js console. I solved this issue by giving an option of running additional script to keep track on errors in a separate terminal window npm run start:errors (more details here). Therefore, all the validation errors are displayed in a separate window and stored in a errors.log file for as long as the program is running.

Additionally, as mentioned in interface section, it is not recommended adding too many websites per single program instance (i.e. more than your terminal viewport can handle). Doing so may result into disturbed display of the statistics that is left outside your terminal viewport. Console is being cleared every time stats need to update for any of the websites and new, updated stats are being drawn, therefore, it is very inconvenient to scroll to the stats that are outside the viewport. Nevertheless, all of these stats are being preserved and user can easily see them by scrolling after shutting down the application.

Because of the issues described above, perhaps, web application would be a better solution for such problem. It is way easier to selectively update information on the web page using a library like React and instead of clearing the whole page every time stats need to update, it is better to send server-sent events (SSE) and make React client-side listen to them and update stats accordingly.

One of the possible design improvements would be to separate the storage of alerts and statistics and "join" them with some unique common key (e.g. website). In SQL database context, for example, we would "left join" alerts inside StatsRepository in order to obtain full stats per website, instead of keeping alerts and stats together.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
src		src
tests/alert		tests/alert
.env.dist		.env.dist
.eslintignore		.eslintignore
.eslintrc		.eslintrc
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md
nest-cli.json		nest-cli.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Websites Monitoring CLI Application

Table of contents

Features

Tech stack

Installation

Prerequisites

Setup

Usage

Disclaimer

Starting the application

Keeping track of validation errors

Starting the application in DEV mode

After starting the application

Available commands

Examples

Interface

Code examples

"Any fool can write code that a computer can understand. Good programmers write code that humans can understand" - Martin Fowler

Periodic website fetching and statistics displaying

Alerting logic

Optimization

Testing

Scripts

Further thoughts

About

Releases

Packages

Languages

kamranpoladov/websites-monitoring

Folders and files

Latest commit

History

Repository files navigation

Websites Monitoring CLI Application

Table of contents

Features

Tech stack

Installation

Prerequisites

Setup

Usage

Disclaimer

Starting the application

Keeping track of validation errors

Starting the application in DEV mode

After starting the application

Available commands

Examples

Interface

Code examples

"Any fool can write code that a computer can understand. Good programmers write code that humans can understand" - Martin Fowler

Periodic website fetching and statistics displaying

Alerting logic

Optimization

Testing

Scripts

Further thoughts

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages