My PC keeps turning off... - Tech Support Walkthrough

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments

He's one of us? I knew I liked him

๐Ÿ‘๏ธŽ︎ 13 ๐Ÿ‘ค๏ธŽ︎ u/ShadowsWandering ๐Ÿ“…๏ธŽ︎ Jan 13 2022 ๐Ÿ—ซ︎ replies

Anthony is GOAT ๐Ÿ

๐Ÿ‘๏ธŽ︎ 7 ๐Ÿ‘ค๏ธŽ︎ u/AcidSugar1414 ๐Ÿ“…๏ธŽ︎ Jan 13 2022 ๐Ÿ—ซ︎ replies

It's really nice that he provided this video. It could have been shortened to only looking at task manager, event viewer, and the bios, but a full and thorough explanation was appreciated.

๐Ÿ‘๏ธŽ︎ 5 ๐Ÿ‘ค๏ธŽ︎ u/JustinBrower ๐Ÿ“…๏ธŽ︎ Jan 14 2022 ๐Ÿ—ซ︎ replies

Thatโ€™s cool didnโ€™t know he was one of us

๐Ÿ‘๏ธŽ︎ 3 ๐Ÿ‘ค๏ธŽ︎ u/masako619 ๐Ÿ“…๏ธŽ︎ Jan 14 2022 ๐Ÿ—ซ︎ replies

TIL.

One of us! One of us!

๐Ÿ‘๏ธŽ︎ 3 ๐Ÿ‘ค๏ธŽ︎ u/LockhartTx2002 ๐Ÿ“…๏ธŽ︎ Jan 14 2022 ๐Ÿ—ซ︎ replies
Captions
- There it is. I have a problem. I just updated the bios on my work station and ever since I did it's been (fingers snaps) randomly shutting off, just while I'm not doing anything. And while normally that's the sort of thing that I would troubleshoot myself, I have another problem and that, is that if I don't leave to start my vacation in the next three minutes my wife is going to absolutely murder me. So, we're gonna call in the cavalry here. Dialing A for Anthony. Hey, Anthony. - Uh, hello? - (laughs) Can you fix something for me? - Maybe. - Okay, cool. My computer is turning off and I gotta go. (fingers snap) - (sighs) Man, I just got back from vacation and I'm already doing tech support. Well, hey, why not come along for the ride and I'll show you my method for troubleshooting barely any information to go off of. Right after this information from a sponsor. - Thanks to Hulkman for sponsoring this video, Hulkman's Alpha 85S helps you instantly jumpstart your dead car battery with no aid required. Even it up to minus 40 degrees Fahrenheit, learn more at lmg.gg/hulkman. (soft music) (upbeat music) - Linus tells me that he updated the bios in order to upgrade to Windows 11 and the computer has been shutting off randomly, ever since. Doesn't matter what he's doing, even absolutely nothing. Let's see if we can track down the issue. Now, the first rule of thumb when doing any kind of troubleshooting, always check the most basic solutions first. So, we'll start with the wall power and mm-hmm, actually, he's got a ups down here and I don't think anything else has been turning off randomly like the TV still got power from what he told me. So that's probably not going to be our problem. That leaves us to the cables, which well, I'm gonna have a hard time getting in under there. And you're not gonna be able to see that either because not only is it kinda dark, but the TV is also covering it all up, but it looks like the 810, 12 volt connector looks like it's okay, actually, there's multiple there. And they both look like they're down all the way. And the 24 pin ATX connector is also pretty solid there. So I don't think that's our problem either. Now the system has a Threadripper CPU, and sometimes over torquing, the retention mechanism can flatten out the sockets pins so the CPU is not making contact, which can make for a no power on, but that doesn't match the description of the issue where it's happening, even when the system is idle. And besides that, I don't think I'm going to be able to open this thing up without some help. And I'd really rather not do that. For now, we'll fire it up and move on maybe we'll hit lucky. First, we'll open up a system monitoring tool like HWiNFO and see what our temperatures and voltages look like. And we'll also load up task manager to see if any processes are running wild and check to make sure that there's nothing weird showing up in the start-up items. CPU usage looks okay to me right now. We're at 3% with only a few things actually, being used right now, you've got a LVS running and I guess Linux is doing some idle mining, which is fine as long as you are not specifically using your GPU for that. As for the temperatures, they look okay, they're a little high, but at the same time, we've got a GPU doing idle mining on a closed loop like this. It's not really a closed loop, but like it's sharing the cooling between the CPU and GPU. And the CPU is currently idling around in the 40s, 50s, or 60s, depending on what's happening. Normally depending on the CPU, the enclosure and the cooling you have on your system, it'll usually hover around 30 or 40 somewhere in there. As it is right now, I'm not concerned about the thermals at all, that seems perfectly fine to me. Yeah. That's not even all that hot. The hotspot temperature is in the 60s. So, this cooling set up is definitely enough. We'll have to find something else that might be the cause of the problem. We'll keep these tools open for now so we can keep monitoring it as we use the computer. That way, if anything does happen, we can kinda be tipped off about it before it inevitably shuts down. Before we move on, we should look at the voltages. Now, something to bear in mind about software reported CPU voltages is that they're not always correct. They can change more quickly than the update on screen. So take them with some salt. It's also worth remembering that CPU core voltage, will go higher or lower depending on the core clocks and what the CPU is doing. Thanks to Load-Line Calibration, a feature that tries to keep the CPU from under or overshooting its voltage range and crashing or potentially taking damage over time. This throws a lot of people for a loop, because even if you set the voltage manually in the bios, it's not guaranteed you're gonna get that voltage under any circumstances. Now this is an Overclock Threadripper 3000. So, I can't really expect any specific voltage range, but I think usually if it's in a low power state, you can go from 0.6 all the way up to around 1.4 or more volts, depending on the load. And while nothing stands out to me right now, this is one of the potential culprits for a random blue screen or power offs. So it's worth keeping an eye on. Next, time to see what Windows recorded last time the PC unexpectedly powered off using Event Viewer. This is built into Windows, just right, click on the start button and go up to Event Viewer here. Or you can access it by typing this on the command prompt if you're ever really stuck. This particular version how you Event Viewer has been around since Windows 2000. It is basically unchanged, even in Windows 11. We'll head over to Windows logs and then system. Now, Buddy fault, this is sorted by date and time. So we'll scroll down to the last time we know it happened and see what Window sees. Now, I actually already see some things here. So that's interesting, but you can actually tell when Windows was shut down, big gap in time, followed by three event log items when the service is restarted. And if there was a blue screen, you'll see some bug check items labeled as critical as well. You can use these items to see the blue screens error code, which you can then use to narrow things down via Google. This PC isn't blue screening so we don't have any bug checks. Well, let us see what else is happening at the time it went down. And it looks like we have a bunch of ACPI events here. These are related to power management and that's interesting. Remember what I said about voltages earlier. We might have just found some evidence for it. And by the way, you'll notice that there are a lot of items in the event log just in general. So how do I know that what I'm looking at is what I'm supposed to look at? Well, informational stuff like these things here are usually not that useful unless you're trying to track down something very specific. And if you've been fixing Windows PCs for long enough, you'll know that some of these distributed calm things are usually just there on a fresh install and they're entirely benign. If you do want to know what exactly these are referring to though, you can copy the string of letters and numbers and search it online or if you're feeling adventurous, in the registry. This particular one is for immersive shell, not exactly exciting or relevant to what we're doing, but, knowing how to do this can be useful for tracking down misbehaving, drivers and services. And just knowing about it will help you if a scammer ever tries to convince you, Windows is broken by showing you these errors. It's a pretty common trap. Coming back to our main issue. We've got a candidate for the problem, but we should also check the application log to see if anything weird is going on there. This is where things like app crashes go. And while these don't usually coincide with Windows issues, if something specific keeps happening at the same time as a recurring problem, it could be related. No, it doesn't look like there's any patterns showing up here. So we can rule this out for now. The security logs aren't usually relevant on single user PCs, but they can show you when you've logged in, when you've used stored credentials and so on. Useful if you suspect you've been hacked or if there's a problem with Windows log on, but it's not what we're after. Set up on the other hand are related to Windows updates and a failed or broken Windows update can sometimes do what we're seeing all the... Usually with a reboot instead of a shutdown as expected. As expected most of this is unremarkable information about which updates have been installed when? But a broken update or Windows update service will show errors here. For now our best lead is the power management situation. Buddy fault, modern CPU's will enter a low power state when they're not in use called C states. Not to be confused with S states like sleeper or hibernation. These are like the performance oriented P states. You might already be familiar with in the form of turbo boost, where the CPU will ramp up its performance in response to load. C states are just in the opposite direction. Then depending on the state, the system might turn off some parts of the CPU or pack inactive cores to significantly reduce power consumption and the voltage supply to the cores and therefore the CPU. You can see where I'm going with this, right? But this is a desktop PC so idle power consumption isn't a major concern. Therefore, let's try and turning off, C states. To do that we will restart into the bios and to do that, first I need to stop all of these logging programs and restart the system. Oh, I probably shouldn't have done that while Linux was still like doing stuff, but oh, well (keyboard key clicking) this is how you get into the bios usually. (keyboard keys clicking) Sometimes you can get into it with F2 and you can get into it through Windows as well if you hold Shift while clicking on restart, then using the advanced options function to select UEFI firmware settings, but we didn't do that. And so I have to match delete. (keyboard keys clicking) From here we're gonna look for C states. Now some bioses like ACEs, lets you use the search function, which we will use. But sometimes you'll find it in like CPU configuration under advanced or sometimes you'll find it in like a power management configuration. For now we'll just hit F9 to do search and do C state. (keyboard keys clicking) And this is enabled, sometimes you'll see two of them, one for global C state control and one for C1E or Enhanced Halt. If we had that here, we would disable it as well. For now, let's go ahead and disable C states. It's a... F10 doesn't work from here, okay. So F10 to save and reset that's pretty normal and okay. If F10 doesn't work on your bios for save and reset, you can just go to the exit screen and it'll ask you if you wanna save, usually. Now that we've rebooted, how can we know that this worked? We can't, we can't really test for a random power off event, so we'll just need to wait. I'll let it go for a few days and then we'll come back to it. Now thankfully, (air swooshing) I've got my water bottle from lttstore.com to keep my drink cold until then. I'll just add a little bit of ice every now and then. All right, let's take a look. It's still powered on, that's a good sign. Let's see the event log. Pixel refresher, I do not care about you. I wanna see Windows, I don't see any of those ACPI events. That's a good sign. That's really good. I think we're good. This was happening multiple times per day before, and it's been a few days now and it's still chugging along. I think it's safe to call this one fixed. As for why the problem started happening in the first place. Well, power management has always been and always will be a little bit of black magic. A little tweak to how power management works in a bios update, silicon degradation, or even just the silicon lottery can affect how stable power management features will be for a given system. Speaking of which get subscribed because we finally have our last guide you'll ever need for how to build a PC coming up and if you liked this video, you'll love that one. It's the longest video we've ever done, except for the, Yule Log. (fire rattling) Back to power management though. It's usually fine for laptops because of how tightly integrated the CPU and the logic board are. But not so much for desktops with modular components and longer traces. Many power management features are in fact disabled by default on a desktop, particularly anything to do with PCI Express. In my experience, it's almost never a good idea to enable these on a desktop, unless you're doing it specifically to see if it does work for your hardware. C states are usually okay, but there's a reason you'll see many threads about them on forums going back many, many years . In this case, whatever it was, it seems like it was just dumb luck, but it's fixed now. And hopefully, you'll come away from this video, not only knowing that C states can be flaky, but also a little more about how to dig in and find out what a problem might actually be. And Linus will come back from vacation and I'll still have a job. And hey, you'll also know a little bit more about our sponsor Pulseway. Thanks to Pulseway for sponsoring today's video. Pulseway, let's use centrally manage all your desktops, servers, network devices, and cloud infrastructure in one place. You'll be the first person to know when a user has an issue or when there's a problem with your it environment. You'll have out of the box commands to take actions such as killing processes, resetting user passwords, running PowerShell commands, backing up files and even remote control. With powerful auto remediation tools, Pulseway can automatically resolve critical IFD failures like low disk space, high CPU, and even restart your services. The patching engine will prevent vulnerabilities by checking for updates for both your operating system and third-party applications and run those updates for you on a schedule that you define. And the best part is that you can do all of that from the mobile app or from the desktop. Try it for free today at pulseway.com or through our link below. Thanks for watching guys. This one's a bit different. So why not check out Linus's new rack? It's also a bit different and a whole lot of fun.
Info
Channel: Linus Tech Tips
Views: 2,217,070
Rating: undefined out of 5
Keywords: pc, computer, workstation, power off, shut off, shut down, random, problem, troubleshooting, repair, walkthrough, technician, windows, bios, threadripper, power, acpi, power saving, c-states, event viewer, process, temperatures, voltages, fix, power management
Id: 1EYLzD2GytA
Channel Id: undefined
Length: 14min 3sec (843 seconds)
Published: Thu Jan 13 2022
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.