1
00:00:00,610 --> 00:00:02,440
Okay, so we are in CloudWatch Alarms,

2
00:00:02,440 --> 00:00:04,540
so let's go and create an alarm.

3
00:00:04,540 --> 00:00:06,870
So first of all, I'm going to create an EC2 instance

4
00:00:06,870 --> 00:00:09,180
very quickly, and we're going to create

5
00:00:09,180 --> 00:00:11,430
an alarm on top of the CPU utilization.

6
00:00:11,430 --> 00:00:15,410
So we're going ahead and create an EC2 instance of t2 micro.

7
00:00:15,410 --> 00:00:17,920
I will go quickly to preview and launch and then say,

8
00:00:17,920 --> 00:00:19,380
yes, I have this.

9
00:00:19,380 --> 00:00:21,000
So we don't need to keep it or anything like this,

10
00:00:21,000 --> 00:00:22,700
we just want the instance to be launched.

11
00:00:22,700 --> 00:00:24,380
And the idea is that we want to create an alarm that

12
00:00:24,380 --> 00:00:28,470
will terminate the instance, if the CPU goes to 100%,

13
00:00:28,470 --> 00:00:30,340
so let's create this alarm.

14
00:00:30,340 --> 00:00:32,780
So we need to select a metric.

15
00:00:32,780 --> 00:00:35,300
And so for this, we need to choose a metric.

16
00:00:35,300 --> 00:00:38,050
So we need to go and find our EC2 instance.

17
00:00:38,050 --> 00:00:42,120
So this is our instance ID, right here, and search for it.

18
00:00:42,120 --> 00:00:44,800
And I'm maybe a little bit too quick, okay.

19
00:00:44,800 --> 00:00:47,410
So let's wait for the instance to be launched.

20
00:00:47,410 --> 00:00:50,610
I will go into EC2 per instance metric,

21
00:00:50,610 --> 00:00:54,540
and then I will wait for it to be populating.

22
00:00:54,540 --> 00:00:57,280
So it took about five minutes for some metrics to appear

23
00:00:57,280 --> 00:00:59,160
in that cloud dashboard for my instance.

24
00:00:59,160 --> 00:01:03,090
So now I can probably refresh this page and I will get the

25
00:01:03,090 --> 00:01:04,790
chance to find the metrics I'm looking for.

26
00:01:04,790 --> 00:01:07,960
So let me select a metric and then paste the instance

27
00:01:07,960 --> 00:01:09,610
in perfect, I have it,

28
00:01:09,610 --> 00:01:14,300
and I will go and find the CPU utilization of my instance.

29
00:01:14,300 --> 00:01:16,110
So this is this metric right here.

30
00:01:16,110 --> 00:01:18,080
Okay, well, select this metric.

31
00:01:18,080 --> 00:01:19,030
As you can see, we have

32
00:01:19,030 --> 00:01:22,190
the eight here and then we can choose a way to

33
00:01:22,190 --> 00:01:25,530
compute this metric, so average, sum, maximum, and so on.

34
00:01:25,530 --> 00:01:28,560
The period we want to evaluate this alarm on.

35
00:01:28,560 --> 00:01:32,100
So five minutes is good because this metric gets populated

36
00:01:32,100 --> 00:01:35,240
every five minutes, if we don't enable detailed monitoring.

37
00:01:35,240 --> 00:01:37,370
Now we get some conditions in terms of the threshold.

38
00:01:37,370 --> 00:01:40,710
So is it Static or is it an Anomaly detection?

39
00:01:40,710 --> 00:01:43,610
Is it a greater than, equal than, and so on.

40
00:01:43,610 --> 00:01:44,530
So I will say for example,

41
00:01:44,530 --> 00:01:48,040
if you're a greater than 95%

42
00:01:48,040 --> 00:01:50,040
for a long time, so for,

43
00:01:50,040 --> 00:01:53,120
and here you can say, three out of three.

44
00:01:53,120 --> 00:01:57,770
So that means that for 15 minutes, you're stuck at 95%,

45
00:01:57,770 --> 00:02:01,160
then probably something is up with this machine.

46
00:02:01,160 --> 00:02:03,810
And so in that case, I could choose a notification.

47
00:02:03,810 --> 00:02:05,580
I could choose a Auto Scaling action.

48
00:02:05,580 --> 00:02:07,930
I could choose an EC2 action, or in a systems manager

49
00:02:07,930 --> 00:02:10,389
action, but I'm going to choose an EC2 action.

50
00:02:10,389 --> 00:02:14,330
Okay, and it will say, hey, if you are in alarm, okay,

51
00:02:14,330 --> 00:02:16,970
then just terminate this instance because maybe I know that

52
00:02:16,970 --> 00:02:20,930
my application sometimes has a huge failure and the CPU

53
00:02:20,930 --> 00:02:23,880
installation will be at a 95% or a 100%

54
00:02:23,880 --> 00:02:25,930
for a very long time, And the only way to resolve this

55
00:02:25,930 --> 00:02:27,720
is to just terminate the instance.

56
00:02:27,720 --> 00:02:31,120
So I will choose this, and then click on next,

57
00:02:31,120 --> 00:02:32,033
and say terminate,

58
00:02:33,950 --> 00:02:35,813
EC2 on high CPU.

59
00:02:38,260 --> 00:02:41,310
Click on next to verify everything, and we're good to go.

60
00:02:41,310 --> 00:02:44,170
So now this alarm obviously does have insufficient data,

61
00:02:44,170 --> 00:02:46,090
so we need to wait 15 minutes for it to be okay,

62
00:02:46,090 --> 00:02:49,910
and it's not going to be triggered unless we make it so.

63
00:02:49,910 --> 00:02:52,610
So, we could go into the EC2 instance and launch a way

64
00:02:52,610 --> 00:02:55,420
to get the CPU very high for 15 minutes,

65
00:02:55,420 --> 00:02:57,720
but this would be a very, very long,

66
00:02:57,720 --> 00:02:59,890
or we can use the API call name,

67
00:02:59,890 --> 00:03:04,180
Set alarm state, to really see what would happen if this

68
00:03:04,180 --> 00:03:07,190
alarm went into the breach phase.

69
00:03:07,190 --> 00:03:08,640
So let's have a look.

70
00:03:08,640 --> 00:03:10,460
This is the history of the alarm, okay.

71
00:03:10,460 --> 00:03:12,884
And what I'm going to do is that I'm going to set the alarm

72
00:03:12,884 --> 00:03:16,163
to say, so I type AWS CloudWatch, set alarm states,

73
00:03:18,250 --> 00:03:20,280
and we'll look at the API reference.

74
00:03:20,280 --> 00:03:23,330
And so we need to do set alarm state the alarm name and the

75
00:03:23,330 --> 00:03:26,380
state value and the state reason.

76
00:03:26,380 --> 00:03:29,920
So we'll do, in here.

77
00:03:29,920 --> 00:03:34,430
So AWS CloudWatch set alarm states,

78
00:03:34,430 --> 00:03:37,080
and then we need to set multiple parameters.

79
00:03:37,080 --> 00:03:41,993
So the alarm name is going to be this one,

80
00:03:45,180 --> 00:03:47,890
then the alarm,

81
00:03:47,890 --> 00:03:50,460
so the states value

82
00:03:52,720 --> 00:03:54,700
is going to be alarm.

83
00:03:54,700 --> 00:03:58,390
And the state reason, is going to be testing.

84
00:03:58,390 --> 00:04:01,160
We press enter, and now this alarm,

85
00:04:01,160 --> 00:04:05,890
if we refer to this page is now in the alarm state,

86
00:04:05,890 --> 00:04:07,670
as you can see, it says in alarm.

87
00:04:07,670 --> 00:04:10,830
And so the action is, when in alarm terminate the instance.

88
00:04:10,830 --> 00:04:12,280
And so if you look at the history,

89
00:04:12,280 --> 00:04:15,310
it says that the alarm updated from OK, to in alarm and

90
00:04:15,310 --> 00:04:18,519
then an action was made and it's, successfully

91
00:04:18,519 --> 00:04:21,600
executed the action to terminate my EC2 instance.

92
00:04:21,600 --> 00:04:24,090
So if I go into my EC2 instances here, and I refresh,

93
00:04:24,090 --> 00:04:26,650
as we can see is shutting down and is being terminated

94
00:04:26,650 --> 00:04:29,470
because while there was an alarm that was triggered on top

95
00:04:29,470 --> 00:04:30,660
of this EC2 instance,

96
00:04:30,660 --> 00:04:33,720
and we did set up the alarm to do this specific action.

97
00:04:33,720 --> 00:04:34,980
So that's it, I hope you liked it.

98
00:04:34,980 --> 00:04:36,470
I hope this makes sense to you.

99
00:04:36,470 --> 00:04:38,420
And I will see you in the next lecture.