I have no idea what's going on in my
network. What are my servers doing? My routers, my switches, my storage,
my AI server with the crazy GPUs. Are they okay? Are they
up? Are things overheating? That kind of stuff keeps me
up at night. What about you? Do you know what your stuff is doing?
Is it okay? If you don't, that's fine. In this video, we're going to change that for me too
because I'm finally going to deploy an IT monitoring system so I can stop being
so stressed and you're going to do the same thing. And this tool I'm
going to show you is just awesome. We can monitor pretty much everything.
Networks, logs, gaming, PCs, my toilet, everything and we get alerts and
dashboards. I love it so much. I'm geeking out over this fun fact. This was the very first IT monitoring
solution I've ever used back when I was on the help desk. It's called WhatsApp Gold
and yeah, that's kind of a weird name, but you'll never forget it. I still have nightmares of getting
networked outage emails from wg@companyname.com. But anyways, they are the sponsor of this video and
they just released their free version for home lab like you and me and dude,
they're not holding back on the features. When I say you can monitor anything,
I mean it. So get your coffee ready. We're going to dive deep into
the world of IT. Monitor. You're going to learn a ton and by the
end of it you're going to know exactly what your stuff is up to. Don't let
your IT stuff keep secrets from you. We're going to find that out and maybe
you'll figure out why that one thing isn't working. You want to know that,
right? Bob? Bob shakes head all. Let's go. Quick note. Progress as a sponsor.
What's up? Gold is their product. Just had to fix that. Alright,
back to the video. Now, before we jump into the install, let's talk a bit more about what it means
to monitor your systems and we'll also have to actually get your
stuff ready to be monitored. Now the most basic thing we're trying
to find out when we're monitoring our stuff, is it up? Is it actually alive? And the way an IT monitoring system will
do this and also the way we'll do this is through a little thing called
Ping. You've probably heard of it. We'll send an ICMP echo request
to our server or computer, which is us just saying,
Hey buddy, you awake? And then if we receive an ICMP
echo reply like this, it's up, it's awake and our IT monitoring system, which I'm just going to say what's up
gold now because I'm tired of saying it. Monitoring system. What's up gold. And actually I'm going to abbreviate
that further. Let's just call it wg. What's up? Gold wg. WG will actually do this and as long
as it keeps receiving echo replies from your systems, from your computer,
your server, router, whatever it is, it'll say it's up. But if it stops
receiving those things, boom, it puts that sucker in a downstate
and it's going to let you know. Now that's cool by itself, but what about the other stuff like
sure your stuff is responding to a ping request, but that doesn't mean it's
working. What about your hard drives? Are they full? Are you out of
space? What about temperature? Is your gaming PC overheating?
How's your network traffic? Maybe your network's slow
and you don't know why. Well, it's not being monitored. CPU
utilization, membrane utilization, there's so much more we have to know
about our stuff to make sure it's actually up and working and that's where things
like what's up Gold come in because they can monitor so much more than ping and
there's two protocols that help us do this and they're pretty awesome. So
answering more than just is it alive? We're going to be answering the
question, is it actually working? I'm going to circle these questions so
they look cooler and they stand out more. I'm not sure if that looks
cooler. Let me know. Coffee Break Network. Chuck Duck Coffee. So our
two protocols, one is called WMI. This is all about Windows.
It's very Windows specific.
It even has a fancy name. Windows management instrumentation.
I don't really like that, but it's what it's, we'll touch
more on that here in a minute. The other protocol is something
called SNMP. This is a big one. It stands for these simple
network management protocol
and if you've never heard of this before, it's kind of crazy. It seems like magic and it seems
like it shouldn't be there. You'll understand when I show you
here in a moment. And in fact, let's just bust open SNMP first because
I feel like it is the craziest thing. SNMP allows our monitoring systems to
connect to a device and collect stats about how it's doing and instead of
receiving a simple ICMP echo reply, we get a lot more like CPU utilization
stats on memory network discs. Now lemme show you real
quick how it works. I'm going to use SNMP to find the
CPU utilization of this server. Here we is found at 10
point 70.7 point 71. We'll talk more about this
command here in a bit, but what I want you to pay attention
to right now is this bit right here. What is this? That's how we ask for CPU
utilization. Everyone knows that, right? No, it looks ridiculous and it's something
you'll never have to know or memorize, but just know this number. It's something called an
OID or an object identifier. And if we go ahead and run this
command, we should find boom, the CP utilization of that server.
Let's try it again. Let's try it again. It's not changing. Or we might use this OID for disc
utilization or use disc space and it shows us a lot of information, so WG or our IT monitoring system while
use these OIGs to find out how your system is doing. Now it's important
to note an OID like this for CPU, it's generic. So weg knows that if it connects to
pretty much any system that has SNMP, which is pretty much everything and
knows that if it looks for that OID, it's going to get CPU
utilization across the board. Now what's really cool is that certain
systems can have more OIGs outside of what's generic and default. For
example, this one right here, this is Cisco specific for a Cisco
switch to find the temperature of that switch. And I point that out because you may have
stuff systems that have proprietary or special OIGs that your monitoring systems
may not be able to look for or may not even know about. That's why companies like Cisco will
have an entire pack of special OS to monitor specific things about them. And those packs are called IBS
or management information basis. I've used these so many times throughout
my career to monitor the important stuff I care about. And all you would do is download the
MIB from Cisco and upload that to your monitoring system to what's up gold and
then tell it to look for these specific os. Now thankfully, systems like WG often have a lot of the
popular vendors and my BS already loaded and once they identify that
that is a Cisco switch, they're going to look for those
specific os. It's very slick. Now, before we move on, just stop and think. SNMP is kind of crazy because most systems
come with it baked in to be able to monitor them and it exposes a ton
of information about your stuff, which is cool and kind of
dangerous. How do you secure that? We'll talk about that. I'll even show
you how to set up SNMP on a few devices. But before we do that, let's talk about how SNMP is
normally used by things like wg. Now, typically by default on
most systems including wg, SNMP will do a thing called polling,
which simply means, Hey server, tell me everything about you real quick.
Thanks. Got it? Just do a little poll, got a little clipboard, whatever. It'll do that roughly every 10 minutes
by default and also by default, most servers are running
SNP on UDP port 1 61. Now this is fine for most scenarios, but what about system critical
things like temperature? I don't want to wait 10 minutes to learn
that my server room has overheated and everything's dying. That's
something I want to know right now. Now we could increase the polling time
to be or decrease it to be something shorter, like every 30
seconds, pull everything. We don't have to do that though. SNP has a really cool function
that allows the system, so our server here to send what's
called an SNP trap. It's a trap. Sorry, I had to do that. This is something
we configure on this server. Let's say for example, temperature and we'll say if it reaches
a certain threshold, it's super hot. Then instead of waiting for
something to be pulled about this, you send an SNMP trap to
the monitoring server, which is also set up to receive
that. Now enough theory, lemme show you how to get SNP set
up here. I have a Linux server. I'll just check really quick to make
sure it's not running. SNMP. That is, no, it doesn't seem to be there. So
first one, install SNMP. Now again, a lot of systems already have
this there in case it doesn't. We'll do this pseudo a PT update to
update our repositories and then we'll do pseudo A PT install S-N-M-P-D. Now if I see what's running on my server
once more using SS with the dash toin switches and grabbing four port 1
61, I can see it is now running. Now a fun fact about
security from another server, I'm going to try and use SNMP to find
the CPU utilization on that server. Now, we didn't talk about this before,
but notice this right here, this public thing with the dash C. This dash C stands for community
string and for many devices, this is how we keep our
SNMP information secure. All the security string is a password
and the reason I have public here is that by default is the community
string for many, many systems. So if it just happens
to have SNMP enabled, odds are it's going to use the public
string that you could actually try right now if you wanted to. So let's see if this by default
has the public community string. Seems like it doesn't. I'm not reaching
it, which is good. Yeah, no response. I like that for what that does tell us
now is that we have some setup to do to make sure that we create a community
string that enables us to access from any monitoring server. The CPU utilization.
Let's set that up right now. So on our server SNP already set
up running, we verified that. Now let's go edit a file. This is really
simple, takes just a few steps here. We'll do a pseudo nano, my favorite
text editor, fight me, it's the best. And we'll edit the file, Etsy snmp snmp d.com just like this we're in now SNP can be quite a complex system. Don't let all this stuff overwhelm you. We're just going to
change a few things here. I'm just going to scroll down and show
you the stuff you want to look for. Now, the first big option you'll
see is agent address. Now this agent address option will tell
it what interface to operate SNP on and where it can listen from to
make things simple for testing, we're going to remove all this bit and
just put UDP colon 1 6 1 telling it to operate UDP port 1 6 1 on all interfaces. Now let's see if we can use the public
string to access it. Boom, we got it. Now odds are you probably don't want to
use the public string because that's not very secure at all. Everyone knows about
it. Let's go change that real quick. We will change that to something
like network check. Now, as I mentioned before,
you cannot only just read, but you can write with SNP and
if we change that O to a W, so read write community string
that would give w permission to write or change information about this
device if it allows that. So for example, a host name is able to be pulled via
SNMP. Let's try to pull that real quick there. Got it. But now lemme
try to change the host name. So I'll go ahead and save this network,
check community string for a read, write, restart my service and change
it to Bernard Hackel with this command. So now when I pull the information, There's the new host name. Now, as you
might imagine, that's kind of dangerous, kind of crazy actually in the
hands of a hacker. You're done, especially since this community string. If you were to sniff the network traffic,
this would be sent in plain text, no encryption, but don't worry,
we do have more secure options. That's where S-N-N-P-V
three comes in. Now, so far we've been talking
about SNPV one and V two C. I don't know why they just didn't simply
call it V two feels like an Apple move, but V two C is the number two option.
It does offer improvement over V one, but it still has the same security
problems. V three has encryption, it does username and password restrict
views and what OS you can pull. So on a production enterprise network,
S-N-M-P-V three is the way to go. Now for time sake and simplicity, I'm
not going to demo how to set that up. We're going to stick with SNPV two. Also, a lot of devices may not support SNPV
three and I want maximum compatibility for this video. Now I am going to go back and change my
read write to read only and I will keep my community string at network check now
as we prep our stuff to be monitored. So think about all the things in your
home network are your enterprise network servers, printers, even
Macs, come default with SNMP. You'll want to go and configure the
same community string across all these devices. Or if you're going
SNPV three, you crazy animal, then of course configure the username
and password the same across the board. Now lemme show you real quick,
I have to show you this. I do have a sonology NA in my network
and you can see right here in my control panel there's a config for SNPV one, V two C and then V three and you
can specify device information, location and contact
information. Now, bonus, in addition to setting up SNMP with
what's up gold and the free addition, I'm going to show you
how to install right now. You can also set up SSH and have
WhatsApp Gold be able to log in and run commands to check for things, custom
things that maybe SMP doesn't offer. So to make that possible across the board, you want to make sure that you have the
same SSH credentials on all your devices so you can do stuff like that. Now
we've been talking a lot about Linux. We're done with that.
Let's talk about WMI. The cool thing about WMI is
that it kind of just works. It's installed on most Windows systems
by default and is running and similar to SNMP, it pulls a ton of information
about a Windows system. In fact, more so in a lot of cases, but typically your Windows firewall
will be enabled and blocking things. This is my best thing. Example of fire. It's blocking things like
ICMP traffic and WMI traffic. Let's enable just those two
things real quick via PowerShell. So we'll launch our terminal and we're
going to launch that as administrator. So we'll right click this and say run
as admin yes and commands will be below. We'll copy and paste and run this command
to allow ICMP traffic and we'll copy and paste this command to allow WMI. Now you could just turn off your
Windows firewall. Don't want to do that. Let's just allow these two things,
keep everything else locked down. Now, unlike S-N-M-P-W-M-I does require a
user account on the Windows system with administrative privileges. So here's
what you would do for the local computer. We're going to go to the search
bar and type in LUS rmg msc. I know it's weird. Just
go ahead and do it. This will launch the local users
management. Now by the way, if you're new to managing Windows in
this way and you want to learn more about that, I do have a mini course actually
free on my network, check Academy, YouTube channel. You can check
out somewhere around here. Go ahead and jump there. It's awesome.
Anyways here, if we click on users, these are all the users on your computer
right now, at least the local users. Notice right here I have
an administrator account. Probably for you it's going to be
disabled. That's good practice. But for testing right now, to have
fun, we can enable it temporarily. So you can just double click him and right
here you can see account is disabled, mine's unchecked, it's already good.
If you want to enable that account, just uncheck that and then
what you can do is right, click this and say Set password and
set your administrator password. I've already done that. Now a best
practice might be to set up a new user. Let's just call him Bob and set
his password. We'll create Bob. Then we can jump inside and then
we can double click Bob, sorry Bob, and add him to the administrator's group
and that should give him the correct permissions. I mean every permission, you could be more fine tuned with this
if you want to research that and make that happen. For now, for simplicity,
that's what we're going to do. So now here we are at the WG installer
IT Monitoring system Install. wg probably has the easiest install of
anything I've ever done for IT monitoring systems, but let's talk
about where you put it. What are your system
requirements? Officially, for like an enterprise environment, you'd want to put this on a Windows
server machine, but you don't have to. You can actually put this on a
Windows 11 desktop unofficially. But I have tested it and it does work,
and this can be a virtual machine, so no worries. That's what I'm
going to show you how to do this on. You will want a quad core processor, at least eight gigs of ram and of
course like everything in the IT world, the more you have, the happier you'll
be and at least 25 gigs of storage. That's all you need. Let's get going
right now. Now I got a link below, so go ahead and look in that description, click the link and you're well on your
way to downloading the WhatsApp gold free edition. Just put your information in, make up your company name and
we'll download our free edition. After we select all the chimneys
that come it, once it's done, we'll go ahead and launch
it. Now, just so you know, as you're installing this free edition, what's included is that you can
monitor up to 10 devices. No, I love that because you're not
limited by what's called a sensor. A lot of other monitoring
softwares out there. We'll give you something free
and so here's a bunch of sensors, but each sensor is monitoring
a ping and then CPU. That's one sensor and then
hard drive. That's one sensor. So you could use all your sensors on
one device, but with what's the goal? You can monitor everything
you want about 10 devices. I like that model a lot better.
Anyways, keep on going. Oh, and also if you are a business and
you're installing this to try it out, you can just simply upgrade to a paid
version instantly without reinstalling. It's just a license key. It's pretty
sick. Anyways, let's keep on going. Standard installation coffee break
except everything blindly. Yes, I agree. Now I was going to check and
make sure it can install for me. I have a pending reboot
because I installed software
updates. Fine, I'll reboot. I'll see you guys in a second. All right, now we need a license
key. Let's go check our email. Pretty sure that's where it went. Ah,
found mine. It was in my email. Cool. Paste that in. We'll click on
next. It's going to verify. Now, what this will also do is install
SQL Server Express as your database. You can change the default password
or leave it default for now. In fact, you can leave pretty much everything
default, including everything for IIS, which is Windows web server. It
will be binding to port 4, 4 3. We'll have it generate a self sign
certificate for now and it's ready to install. Let's go ahead and proceed. Now, this may take a minute and perfect time
to go and make sure that all your stuff has community strings. SSH login, and
this can include thing like Ubiquiti, unify, network hardware. Just go
ahead and check all your stuff. Also, one more note, and I don't
want you to miss this, monitoring things like GPU and other
custom stuff like that doesn't come out of the box with things like WMI and SNMP.
For that we'll do a custom SSH monitor, which is just so cool, but I'll
show you that here in a bit. Okay, that took forever. So the MP M or
the network performance monitor, do you want to allow it to
access public networks? Yes, we want it to be able to scan everything
and it's going to be really cool how it discovers everything. It's so fun. Allow
yes, do all the things please. Okay, it's finally done, and what it may do first is launch a
web browser navigating you to this page. We'll set our admin password
set and we're done. It's set up. Now here comes the fun part and the good
news is you've already done a lot of the work in setting up your servers
and all the devices with your community strings for SNMP and then WMI having
that administrator user account set up. So let's do a scan. We'll go to discover, click on new scan and this is how
we're going to add all our devices. It's just going to go out and look for
them on our network. So here we go. You can see by default it's already
going to include my local subnet and gateway. Cool. I don't have to add
that. I'm going to add a few others. I do have a lot of stuff going on
and that's all I'll add for now. Click on next. It will
at this current time, if you're connecting to a WN controller
or a VMware virtualization environment or a storage device, it'll automatically discover things
about those like VMs and such. I'm not going to enable that right
for myself. We'll click on next. Now we're at credentials.
Notice by default, it does have the public
SNPV one community string. We want to add ours we just created,
so I'll click on the plus sign here. I'll do S-N-M-P-V two, call this network, check my SNMP string and I'll
put the read string in there. Notice you could put the right community
string in there too. Click on safe. And then I also want to add my
WMI. Now, as you can see here, we do have a bunch of other options for
adding credentials, like a lot like AWS. That's cool. We'll talk
about that here in a moment. Let's go ahead and add windows.
See my windows for domain username, if you're using a local account, you'll simply put in period slash to
say it's local and I'll do administrator because that's the account I set up and
then my password and click on Safe and then we'll do one more. I want to do my SSH account
specifying username and password, and if using Cisco devices or anything
that might have an enable password, you would add that there
as well and save. Okay, so now we can say use these
three credentials when
you're trying to scan all the devices in my network, click next, I'm
going to have it enable auto discovery. Click on next. We could set this as a schedule to run
every day to discover new devices that come on our network. I don't
want to do that right now. Just click on next and I'll run the
scan right now. Ready, set, discover, and you'll notice this is going to
take a minute because it's scanning everything you have. Now,
if you just have one subnet, it may not take very long,
but I'm going to let that run. I've got a morning meeting,
I'll be right back. Okay, my scan finally finished it, found a ton
of stuff. Now just a couple of things. Notice that for a lot of my devices,
it automatically found the names. It also automatically determined what
the device role would be and what type of things it should monitor, and that's what the roles kind of play
in this role thing. They decide like, okay, if it's a router, we're going
to monitor router type things. If it's a Windows device, we're
going to use WMI to monitor, monitor windows stuff. And then you see over here we
have monitoring capabilities. This little symbols for our router
here, we have log and application stuff, but at this current time, all these devices are simply
discovered and I'm not monitoring them, so keep in mind you are with a free
edition, limited to 10 devices, but you can still have these sitting
here discovered. Now we can do this. Watch this. I'm going to
click on this button here, display map to see what's
going on, what's up gold. We'll actually automatically create
maps for you based on your network. For example, here, Hogwarts would be my router and
it's connecting a few things here. It's pretty sick. Now let's actually monitor some stuff
so we can get back to our device view by clicking on device list right here, and I'm going to monitor a few key
devices like NC Win two and the room of requirement. That's one of my Windows
editing machines here at the studio. I'll monitor my Pop OS system,
which is my new AI server, and I'll monitor Hogwarts.
I'll do my nas, Docker server, chasm server, my Proxim host. We'll do that and I'll click on start
or update monitoring and click on start. You'll see here that while we're
still in the discover area, these should turn green here in a
moment. Green circles, I believe. Boom. There they are. Awesome. So now
instead of being in the discover tab, I can actually go over here to my
network tab and actually see my monitored devices. Now let's have
some fun. Real quick. I'm going to click on the
room of requirements and
then I'll click on the heart up here, the status and
go look at everything. So first I can already see
under the monitoring tab, I'm seeing some response time and
that it is up the ping monitor. Let's click on disc and CPU Memory.
It found my core I nine processor. It found my memory, my how much 64 gigs and how much
utilization is happening right now, and of course my discs and one
volume is getting pretty full. Again, keeping in mind, this is
all coming in through WMI. If this were a router or switch, I
could see information here on that tab, but it's not. Let's go
find one real quick. Let's go back to my network
and I'll jump into Hogwarts, which is my unified system, and
actually I got two things that are down. This is my dream machine.
Oh, it's like a DNS filter. So it actually went out and found all
interfaces and maybe not all of them. I want to monitor, but let's jump in
there real quick. We'll go to status. Let's go to router switch
interface, and this is pretty cool. I mean we've got all the
interfaces right here. We can see utilization details on each
interface and whether or not they're up. Let's pick a Linux device like
Pop Os here. I'll pop in there. That's my AI server. Actually,
one interface is down. Lemme check on the live activity
and see what's going on. Oh, got some critical availability here.
Anyways, let's jump in there real quick. Go to status. I can see my rise
nine in there. Memory usage, 125 gigs. This is crazy. I actually
missed one of my micro tick switches. Lemme try to add that manually real
quick. So I'll go to discover a new scan. I can do a quick one off IP scan and
let's watch it come up real quick. Okay, there it is. Let's go ahead and start monitoring that
bad boy and we can see everything about this. I love it. Now the goal for this video is not to
teach you everything about what's up gold. That would take way too long. There's a lot of bells and whistles
that you can just go crazy with and play with and I hope you do. But let me just show you a few
things while I got you here. First thing right here, I got a map for
my Discover Networks and actually look, nasty Switch came in brand new and we're
getting connections to him. Very neat. We also have dashboards. So if I
click on the analyze tab up here, go to dashboards, let's go to
my home dashboard. First of all, this is just cool, right? You get a
nice little fuzzy feeling like, Hey, I've got 10 things up, or maybe you
got things down, quick view of that. Then you can go to top 10 and it'll
show you top 10 on CP utilization, disc utilization, memory ping
interface, and this is done over time. It's always watching, so you don't
have to, so if you're like, man, my network was super slow at this time,
why? You can jump in here and look. Or Hey, my kids are complaining about the
computer running slow or it's not working. Maybe their disc utilization super high, their memory is super high consistently. You can jump in here and see that stuff.
I'm even stuff like this interface, errors on your switch on your
router, that'll help you identify. Maybe you have a faulty ethernet
cable or an interface misconfigured. I mean it's so powerful what you
can do with this kind of stuff. Action and alerts. Now, if we were
connected to a proper wireless controller, it would show us our
wireless environment as well, but I'm using a unified dream machine
and it doesn't really do that here. We get critical activity, just summary of what's going on and
storage monitoring. This is really cool. It's connected to my sonology
NA and telling me everything. Got to know I'm pretty much
out of space. Look at that, which is why I switched to
my new na. It's 55 terabytes. By getting back to Analyze, you can jump in and just drill into a
bunch of different views and dashboards, performance type things, network device
specific things, alerts and actions. You can actually do a knock viewer, which will kind of go through like a
slide deck or a PowerPoint presentation going through different
dashboards you can create. So imagine putting this on a spare monitor
or a screen in your knock or wherever you want to put this. I'm going
to set this up in my server room. You actually see this as a monitor
behind my glass currently working on that right now. Now, while I can't show you
everything about wg, what's up gold, I'm going to show you a few things that
I think demonstrate the power of what this thing is and how crazy it is that
you can use this for your home network. So we're going to do two
things real quick. First, I'm going to show you how we can monitor
absolutely everything. It's insane, and I'm first going to start out with
showing you how to monitor something that doesn't come default.
Actually, two things. First, I want to monitor an A CTP port 3000, which is what I have my open web UI server
running on, which runs my chat, GPT, my local chat, GPT from my daughters. I want to make sure that the server's
running all the time, and if it goes down, I want to know about it. Then I will
show you how to monitor the GPUs. Now, this is my AI server,
Terry, so I wish it would, I need to change the name Terry. Terry
has two GPUs inside of him. By default, it's not going to be monitored. I'll show you how to do that with a
custom SSH command that we can use to monitor something. It's so cool.
Limitless, I'm telling you, limitless. Let's try it out. So first I need to create an HTTP
monitor to monitor Port 3000. For that, we're going to go to settings,
libraries, and monitors. This is a library of all the
monitors we have here. Under Active, we have an HTTP monitor by
default, that's going to be 80. I'm going to copy that and say AI server, open web ui, and I'm going to monitor TCP
3000 and save. Cool monitor, create it. Now I'm going to go over
here to Terry Pop Os. Right click, click on edit device
monitors and active monitors. I'm going to add one. I'll select one
here. First option from right there, AI server, open, web ui, and click.
Okay. Let's jump in and Terry real quick, go to a status and see if that
monitor comes up. Cool. There it is. AI server, open web ui, and
it does show up. Perfect. Now, before I show you how to set up
a custom SSH monitor with GPU, let's set up an alert like what
happens when this AI server goes down? How do I know about it? I'm not going to just be looking
at the screen all the time. Let's set up an alert real quick to
set that up. We'll go to settings, actions and alerts as you might
imagine, and then actions and policies. I hear what's pretty cool is you
actually have sounds that will play. If you have it open on your computer,
it might scare you. It scared me. I'm not going to lie. So what we'll do first is click on
the plus sign here to add an action. Look at the options. We have so many. You got a beeper who has a beeper a pager, at least they have some modern things
like create a ServiceNow incident post to I-F-T-T-T, slack, a
PowerShell script, SSH. These are all in reaction to something
happening with anything to do with your network. CPU utilization
spikes over 75%. Boom, alert. You can do pretty much anything and it's
awesome. For me. I want to do Slack. I'll select that. I'll name it AI server alert, and then we'll do our web hook URL. And I'm not going to demo how
to set up a Slack webhook, URL. If you're a Slack user, you can figure
that out. I'm just demoing that. You can set up an alert. I'll
paste my web hook URL in here, which will post to a certain
Slack channel. I'll save that. And then we'll go define
an action policy up here. The action policy will define
what does a downstate mean, and then what do we do when
something's down? I'll show you. We'll click on the plus sign right
here and we'll name it my policy, just something generic, and
then we'll add a policy rule. Click on the plus sign. Our action will be our AI server alert
that we just created the Slack alert. And then based on what state, down,
down at least two minutes, 20 minutes, five minutes, what does down mean? I'm
just going to say down immediately. If the first instance that
you sense that it's down, tell me that I can have a blackout
schedule saying on the weekends, I don't care if it's down. That's never
the case though. You'll get that call, click on, okay, and then I'll add
another policy saying same action, but if it's up, go ahead and let me
know it's up. So if I get a down alert, I'm like, oh, crap, it's down. But if
it comes right back up, I'm like, okay, I don't have to worry about it
right this second. That's my policy. I'm going to save it. So now
to add my policy to the server, I'll right click it, go to edit device. Now I could just add it to the entire
server to all of Terry by going to Action Policy and selecting the policy, but I only want to alert
on one thing right now. So I'm going to right click Terry,
click on edit device, click on monitors, and then active monitor properties.
I'll select the monitor I added to 'em, which was AI server, open web ui,
and under here under action policy. I'll set that, just that one
monitor to my action policy. So now I'll stop that web service taking
down open web ui, and once it pulls, it should tell me that the service is
down via Slack and boom, there it is. I got the alert. Now there are a few
other things down, but specifically ai, web server open web UI is down.
Now if I bring it back up, I should get the same thing. And
then there's my up message saying, AI server open. Web UI is one of
the interfaces that are up. Now, of course these messages
are totally customizable, but I just want to show you the
power that you have right now. And speaking of power, let's monitor my
GPUs now, right now my Pop OS server, good old Terry, you know I'm
going to change his name. I'm getting tired of calling him
Pops, I'm change it real quick, Terry. Now, right now, Terry,
we know nothing about his GPUs. Let's add a custom monitor. So I'm going
to give back to my network over here, jump into Terry and go to his properties. I'm going to add a performance monitor. So I'll click on the plus sign
under performance monitor, and I'll create one just for this device.
I'll do an SSH performance monitor, say GPU one temp, and I'll run
this command. This command, if you just ran it on the command line
would give you the temperature Here I can have the monitor, log into
the device, run this command, and then pull back the information. I'll save this and here in a moment
we should see something come in. And if I jump back into Terry here,
there's GPU one temp, last poll, 34 degrees. You can actually
see it here on the oh, too much on the network tab. If you
select Terry and performance monitors, you can see the GPU temp
being polled right there. Let's add the other one or on the command
on GPU one. That's the location there, and that should poll and pull that up
here in a moment. Yep, there it is. My two GPUs, one and two, and I can do
that off utilization or whatever I want. Really what you can access via the
command line is kind of limitless. So you can monitor anything, literally
anything, and that's so cool, and they can do PowerShell stuff too. So now you've got no excuse not to
monitor everything in your network. And now WhatsApp Gold did graciously
give me a license to upgrade and do some crazier stuff. Let me show you what
my WhatsApp gold looks like right now. I'm still getting set up, but
I've got a ton of devices, but I also have access to
their configuration management, which will back up any configs I have
on my Cisco switches, Juniper switches, that's powerful. It
also has log management, so I can send my Windows
device logs to here. Also, any sys log from my Linux
servers, BAM, right here, really anything that can produce sys log, which is everything can go here and
I can digest it, it's delicious. I also have network traffic analysis,
which we'll look at NetFlow, and if you don't know
what NetFlow is, man, it just tells you everything you want to
know about the flows of traffic in your network. So you can send
that to here. What's up Gold? It'll tell you everything. And if
you're rocking VMware or HyperV, they got virtual monitoring. Hopefully
they add Prox Mock soon, please. But seriously, again, just such a powerful program for a
home lab or this is a dream for an enterprise. Dude, I use this in the
enterprise and it works so well. Again, I still have nightmares about receiving
the WG emails and like, oh my gosh, it's down. It was so
good. So that's the video. I hope you learned a thing or two about
monitoring. That's a whole industry. It's a whole thing because yeah, it's
important to know how to set up stuff. It's servers and the hardware and
infrastructure, knowing how to manage it. But what is there? What's it doing?
Is it breaking? Is it overheating? Or whatever it is. So knowing how to set up monitoring
and what is used in monitoring. S-N-M-P-W-M-I it.
Monitoring systems alerts. If you're in IT or you plan to be in it, you're going to use something like
this. So if you're watching this, you're like, I don't know, I
don't have a home lab. Create one, start monitoring it right now.
You're using a computer right now. You can monitor that computer
and start to build your home lab. And this actually could be a
resume building opportunity. You can put on your resume that you
run your own IT monitoring system. What's up gold? It's been
around for a long time. Create a blog about it or something. Put it on your resume that's going to
make you look good. That's the video. Thank you to Progress and WhatsApp goal
for sponsoring this video and making this thing free. It's so
sick. If you do set this up, I want to know about this. So put it in the comments below or
tag me on hex Twitter, whatever it is, or jump into our Discord
community. Check the link below. We'd love to have you come by. Say hi, hang out for a little bit and
talk about it. Monitoring systems.