Advanced Git Tutorial - Interactive Rebase, Cherry-Picking, Reflog, Submodules and more

Video Statistics and Information

Video

Captions Word Cloud

Reddit Comments

Captions

This is an advanced git course taught by Tobias Gunther. He'll teach you some more advanced concepts and tools to make you more productive and confident with Git. Hello Free Code Camp friends. My name is Tobias. And I'm going to make you a better get user today. If you're using Git is your version control system, you have a lot of powerful stuff available. And we're going to talk about some of the more advanced topics today, cherry picking, interactive rebase sub modules, and much more. At the end of the session, you'll be a lot closer to becoming an advanced git user. Before we start a huge shout out to the Free Code Camp team, thank you so much for being on this mission of teaching people how to code and thanks for letting me contribute a little bit. A couple of words about my own background, I am part of the team behind tower tower is a get desktop GUI for Mac and Windows, we've been around for more than 10 years and helped over 100,000 developers and designer, designers work more easily with get become more productive with Git and make fewer mistakes. For today's sessions session, you do not need to have tower installed, you can follow along on the command line, don't worry. Okay, let's get started. Interactive rebase is like the Swiss Army knife of good commands, lots of use cases and lots of possibilities. It's really a great addition to any developers tool chain. I'll start to explain what you can do with it. And then we'll look at a couple of practical examples. So in short, interactive rebase allows you to manipulate your commit history, you can make changes to your commits after the fact. So you could change an old commits message, you can delete commits, reorder them, combine multiple commits, and even reopen a commit, edit it and make new changes from its change that crazy stuff. Keep in mind, though, that interactive rebase rewrites your commit history. So the commits you manipulate will have new hash IDs. Technically, they are new commits. And there's a simple rule, you shouldn't use interactive rebase on stuff that you've already pushed to a shared repository. Use it for cleaning up your local commit history. A good example is when you're done developing on a feature branch, before you merge it back into a team branch, you can optimize that clean up the commit structure, so it's easier to understand. And that's exactly what interactive rebase is meant to do. Before we take interactive rebase for a test drive, let's look at the general workflow. This is the same no matter what exactly we're doing deleting a commit changing a commit message combining commits, the general steps are always the same. So the first step is to determine which range of commits you want to manipulate. How far back in time do you want to go? Once you have that, you can start the actual interactive rebase session. And finally, you'll have the change chance to make your edits. You can manipulate the selected commits by reordering deleting, combining and so on. Okay, let's try this in practice. We'll do two things as examples. First, we'll change an old commits message. And secondly, we'll combine two old commits into one. Alright, let's hop into the command line and take a look at our repository. git log. Okay, so let's say we want to change this commits message here, right. Just to note, as a reminder, if it were the most recent commit, we want to change this one here, we could just use git commit amend to change the commit message. But for anything older than that, for this one, this one, this one, we really need to use interactive rebase. So the first step for interactive rebase, determine our base commit termen, how far back in history we need to go. And this is at least the parent commit of the one we want to manipulate. So we said we want to change this here. So at least this one here. I could simply copy the commit hash of that commit, or I can do a little bit of counting. So this is head currently head minus one head minus two head minus three, I'll use that. So Git rebase. Stash interactive had till the three. So there is an editor window popping up just as I promised. Let's make this bigger And you can see all of the commits that we just selected. And by selected, I mean that we referred to a range of commits, had to leave three, three behind the head, from head all the way to the one we mentioned. So we can now manipulate these commits. But big but, but we don't change the commit message right here. In this window, we only tell git, what kind of manipulation we want to perform. So we just mark up this line here. With the reward action keyword, that's the action keyword that allows us to change the commit message. And don't worry, all of the action keywords or protocols are documented here in the comments. Alright, reword, then we just save and close the window, we do not make our changes to the commit message right here, right? We do not do that we only mark it up. So save and close. And that's actually it. So now we get an editor window. Again, why is that because we can now finally actually change the commit message. So let's make a change optimize the markup structure, very important. Save and Close. And let's check git log again. And we can see it's now optimized the markup structure. So we've successfully changed an old commits message. So let's do one more example of an interactive rebase. Let's combine. Let's take a closer look. Let's combine well, let's say these two, these two commits here into one. Again, first step of any interactive rebase. Correct, we have to determine the base commit. And again, we have to go at least to the parent commit, so at least this one here, so it's head on is 1234. Okay, so the git rebase, interactive head, and tail D for again, here are the commits, we just requested for manipulation. And this time we're using the squash keyword. Squash works by combining the line we mark up this here with the one above. So by marking up line number two, with squash good, we'll combine it with line number one, which is what we want, right, we want to combine these two commits, so markup, this one with with squash and combine it with the one before. Again, we just save and close the window. And a new window will pop up, right. Why is that because by combining two old commits, we are creating a new one, right. And we can give this new commit a commit message, of course, so I'll just write combine two into one. And again, save and close. And let's see what happened here on the command line. All right, now we have combined these two old commits, you can see they were present here, and they are now combined as one under this commit here. So these were just two examples of what interactive rebase can do. If you want to see the other possibilities, you should check out our video about undoing mistakes in Git, and I'll add a link in the description. Normally, when you integrate commits, you do this on a branch level, you use a command like Git merge or git rebase and integrate all the new commits from one branch into your current head branch. And normally, that's exactly what you want. But in some situations, you might only want a specific commit not all the commits from another branch. And this is where it gets cherry picking tool comes in handy. I'll show you a practical example in a minute. But before that a word of warning. Don't get too excited about cherry pick. Your main way of integration should still be on the branch level, merge and rebase were built exactly for this job. Cherry Picking should only be used for very special situations, you need to have a good reason to use it. It's not a replacement for merge and rebase a sure I'll show you a great example for when cherry pick is actually the right tool to use. Let's say you made a commit on the wrong branch. As an example many teams don't want you to commit directly The domain or master or other long running branches, and still we forget and commit there by accident, of course. And this is a perfect example for cherry pick. Let's take a look at an example situation in practice. Okay, so currently we are on master and have that checked out. And the last commits is newsletter signup page. Okay, but we already have a feature newsletter branch. So my guess is, and I'm correct. This should have been here, this should have happened on the feature newsletter branch. So we want this commit here to be on the feature branch and not on master. So let's use cherry pick to move it over. Oh, okay. So first, we switch to the feature branch, feature newsletter in that case. And then we cherry pick that commit over and we can just copy the commit hash. So this is the one we want to the clipboard. And then git cherry, pick and hash of the commit. And Wallah, let's take a look at what happened. And we can see, boom, here it is. Now, it should have been here in the first place. And finally, here it is. If you also want to clean up the master branch, so this doesn't linger here, it shouldn't be here. Then you can also do this week and get checkout back to master and then get reset to Ashutosh hard head till the one head till the one means one behind the head. So we're effectively deleting this from the master branch. And Wallah. Master is clean and newsletter, feature newsletter now has that brand that commit sorry, well as if nothing had happened. Let's talk a bit about a very special git tool, the ref lock. You can think of the ref log as gits diary. It's a journal where git logs every movement of the head pointer. So what is the movement of the head pointer Exactly. That's when you commit checkout, merge rebase, cherry pick reset, all of the more interesting actions are documented there. And this makes it perfect for those situations where things go wrong. Let me show you a perfect use case for the ref log. Let's say we don't want these two newest commits here. At least we think we don't want them anymore. And to get rid of those, we'll perform a reset, right? They've disappeared from the commit history. And a while later, we noticed that this was a horrible mistake. We've just lost valuable commits. So let's have some fun and make this horrible mistake in practice. Alright, so here we go. Let's say we want to throw away these two commits, and get rid of them. So we are making this here our latest revision on the branch. I just copied the commit hash here to the clipboard. And on the command line, I can get reset dash dash hard and use this commit hash. Alright, let's have a look what happened. And we can see those newer commits disappeared. And we're back at this rubbish in here. I'll give you a couple of seconds to realize that this was a horrible mistake, panic emoji con. And there was invaluable data in those commits, and we just deleted them by accident. So let's see if the ref log can help us in this situation. Opening the ref log is as easy as typing get ref log on the command line. And this here is the ref lock that journal where git logs all of the important actions. And first of all the entries are ordered chronologically, which means the most recent, the newest items are at the top. And if you look closely, you'll find this catastrophic reset action right at the top. We just did that 20 seconds ago. So the journal seems to work. Good news. Now if we want to undo our last action, we can simply restore the state before, which is also documented here. We can just copy the commit hash, this is the state before. I'll copy that to the command line to the clipboard. And I could use git reset once more totally valid, but I'm going to create a new branch. Should I find a little bit more Elegant, happy ending. And I will have it start at that previous revision with the correct state. So let's use that. Let's see what happened. Okay, there's a new branch. Wow, that can that contains the seemingly lost commits. We just saved our necks. Awesome. So we just restored some deleted commits with the ref lock. And here's another great use case for the ref log, restoring branches. So again, we think we can clean up and delete something. This time, it's a branch not commits. But again, as life sometimes goes, we realize that it was a bad idea. And we still need it. So let's take a look at this scenario in practice. That is, ref log, re cover, Branch. Okay, so here's a beautiful, beautiful feature login branch. meds, let's say our customer or boss or team lead, say they don't want it anymore, it's not necessary. So of course, we go ahead and delete it. Just to make things worth worse, this commit here, this one is present nowhere else. So we are definitely going to lose data when we deleted. All right, but before we can delete it, we need to step away from that branch. It's currently the head branch at the moment, and we can't delete the head branch and get so we're checking our master. And then we can delete, we have to force deleted because it contains a commit that is not merged into any other branch. And Wallah. Okay. So now let's say our customer changed their mind. Again, the feature is necessary after also, what do we do? What do we do? What do we do? Let's take another look at the ref log. Alright, and it turns out, we're lucky the last entry was when we made that checkout to master. So let's try returning to the state before again. And I will copy the so here is the checkout and I'll copy this here to the clipboard and get put that on top. git branch I think it was called featured login if I'm right, and we want to have it start at that revision here that we just copied from the ref log. And let's see what happened. So okay, a branches back Wallah. And it includes that seemingly last commit that valuable commit another happy ending. If you're using Git in a desktop GUI like tower like you're seeing here, undoing a mistake is really easy, you can simply press command Z to undo your last action, just like in a text editor, when you made a typo, and this works for almost anything, accidentally deleting branches, commits files, undoing a commit or a bad merge, you might have pushed something too early, doesn't matter what it is. So I can just go ahead and delete this one more time, delete, forced delete, I can just press Command Z. And again, there it is very helpful. Pretty often in a project, you want to include libraries or other third party code. And you can go the manual way, download the code, copy the files into your project, and commit the new files into your projects. Git repository. And this is a valid report approach. But it's not the cleanest one. If you simply throw those library files into your project, you're inviting a couple of problems. First, you're mixing external code with your own unique project files. But the library is actually a project of itself. And it should be kept separate from your own work. There's no need to keep these files in the same version control context as your project. And secondly, should the library change because bugs were fixed or new features added? You'll have a hard time updating the library code. Again, we need to download the raw files and replace the old items. These are quite common problems in everyday project. So Git has a solution and that's sub modules. A sub module is just a standard git repository. The only specialty is that it's nested inside another parent repository. sub module is a fully functional and Git repository. You can modify files commit, pull, push from inside, like with any other repository. Let's see how sub modules work in practice. Alright, so we have a little sample project here. And let's say we want to add some third party code. Creating a lib or or vendor folder is a good idea so that it's cleanly separated from our own files. Let's do that. So let's create lib, and change into that folder. And I will add a little JavaScript library from GitHub, I can do that with a Git submodule, add command, and then provide the remote URL of that repository. And you can see that the command starts to clone the Git repository responsive misspecified. And let's see what happened in our Working Copy folder. And here we go. We have a lib folder, I just created that. And here it is. Our project now contains that third party library inside the lib folder, you can see there's a dot Git sub folder here. So this is indeed a fully featured git repository. And let me emphasize this, again, the actual contents of that sub module, they're not stored in our parent repository. Right? This is important to understand apparent repo only stores the sub modules remote URL, the local path inside the main project and the checkout revision. Of course, the sub modules, working files are here in our project. Now, after all, we want to use the libraries code, but the files aren't are not part of our parent projects. Git repository, they're not part of that version control context. Okay, so let's take a closer look at what else happened. So there's a new git modules file here. So let's take a look at that. Get modules. And so you can see here is the path in our project and the remote URL documented, and as also a record of that in the Git slash config file at the bottom sub module lib to progress again. And finally, the Git also keeps a copy of each sub sub modules git repository in its internal git modules folder. So here is the Git repository. There is modules, and now we have lip to progress. So this is the the Git repository of our library, stored inside of our main git repository. These are all internal files. So please don't mess with this configuration. Because as you can see, get internal gets internal management of sub modules is quite complex. There's good modules, there is dot git config, there is dot get modules. My strong recommendation is don't mess with sub module configuration values and files manually. Do yourself a favor and always use proper git commands or get desktop GUI to manage sub modules. All right, so let's have a look at our projects status. And you can see that Git regards adding a sub module as a modification like any other, so this means we have to commit it to our main repository. Let me just do that quickly. So Git commit dash m, Add to Project to progress library. Okay, we've successfully added a sub module to our main project. Congrats. Now, let's start from the other end. Let's clone a project that already has sub modules added. And let's step out of that and get clone I will clone the Apache Airflow project for that. And let's see what happens here. So I know this project has several sub modules added and we'll take a look at those in a second. Okay. All right. Here we go the airflow project. So let's see. So I know that the product checked has some sub modules in the dot GitHub actions folder. So these are our sub modules for this project. But if we take a closer look after cloning, visual empty. Okay, so what's happening here? Why, why are those sub module folders empty? Well, you already know that a project report does not contain its sub modules files, right? The parent repository only saves the sub module configurations. So when we clone a project with a default git clone command, like I just did, and we only download the project itself with the configuration, the sub module folders, however, they stay empty. And we can repair that by now calling get sub module update. And we have to initialize these for the first time. And we want to do this recursively. Sorry, I have to step into the airflow project folder. And now you can see this action happening here. It's several cloning processes start. And And now, the sub module folders have been populated, or in other words, the sub module Git repositories have been cloned can take another look at that and see, well, yeah, these are all populated now. Right as files in here. Let me see. So using git clone, like we just did, and then an extra command is a bit unnecessary, I would say. So you can achieve the same if you use a particular option with Git clone, the recurse submodules option. This tells Git to initialize all sub modules, when the cloning is finished automatically. So I would have used the command like this instead, right, we left that option here out the recurse submodules. And we would include that now, and everything works from the beginning. Alright. I don't want to go into too much detail about some modules. But one more thing is important to understand the way revisions are checked out in a sub module. A git repository can have countless committed reversion revisions, but the files from one specific revision can be in your working directory at one time, right. So in a normal git repository, you check out a branch. And automatically the last commit in that branch is your checkout revision. If you add new commits to that branch, the checkout revision is all always automatically the newest commit, right, we just move the pointer forwards to the latest commit, like like you can see here. sub module repositories, on the other hand, are always checked out on a specific commit, not a branch. And it makes sense, think about it, the contents of a branch can change over time when new commits arrive. In a sub module, however, you don't necessarily want that sub modules are mostly library code. And in that situation, I want to guarantee that I always have a specific revision of the code. Even if the maintainer of the library ships, some genius updates. I don't know if those would break my code. So I always want a specific chosen version checked out. Alright, let's quickly hop into the sub modules in this project. I can show you that. So this is a target client. And when I select the sub module here, so here you can see that this sub module is not checked out on a branch, but on a specific revision. And this is what I meant. I've already mentioned it managing sub modules is a bit complicated, I can really recommend taking a look at a good desktop, Gui, like tower, for example. So adding updating moving sub modules, all of that is really simple here, take a look and see if it's helpful. Okay, we could talk for hours about sub modules, but I think this should be a good introduction. Just try them out in your project and take it from there. And git repository is a perfect log of all your activities in a project, every change is documented. And sometimes you want to browse through this log, because you're looking for something you're wanting to find a piece of information. So let's talk a bit about how searching and finding good works. Or in other words, how you can filter your commit history. You can filter by almost anything by date by message by author by file by branch even. So let's start with date and search for all commits that happened after a specific date, for example, we're doing this in the Ruby on Rails, open source repository. So there are a couple of commits present, you can use git log and then the after flag. So now I get all of the commits that are after July 1, I specifically specified in that repository, I can also combine that with before, and thereby get all of the commits that are between July 1 And July 5. Pretty easy. And if you want to search for a certain commit message, you can also do that, you can do that with the destination grep. Flag, and let's search for anything that contains refactored in its message. And you will see here is one, here's one, and there's lots of others. You can get really fancy with GREP, because it accepts regular expressions. So actually no limits to your creativity. Looking for a certain author works exactly the same using the author flag. So git log, let's search for anything from a certain Heinemeier colleague. And of course, before I forget that, you can combine these criteria, of course, so for example, you could use dash dash author and dash dash before in combination to find all commits from a certain person before a certain date. And then you can also search for files, this can be really handy to understand how a certain file evolved over time, for example, when you know that a certain file used to work fine in the past, but it doesn't anymore. And you could filter for commits with only that file. So let's do that git log double dash, I'll explain that readme dot markdown, sorry, markdown. And I get all of the commits. Were readme dot markdown was manipulated. The double dashes in that command, if you've taken a look, there just to make sure that Git doesn't confuse the file name with a branch name, right. And finally, pretty helpful way to see commits that are not in one branch, but in another one. So very handy. If you want to want to find out, for example, what happened in the main branch, let's say, after you branch off with your private feature branch. So let's type git log, feature. Login, dash dash, or sorry, dot double dot notation, main. And this will show you all commits that are in Maine, but not in feature login. And you might then decide to do a merge so that you're up to date again, or you can at least see what happened in Maine while you were gone in your future work. Alright, so much for today. Be sure to check out my little advanced git kit. It's completely free of charge. It's a little collection of short videos about a lot of advanced get topics from things like interactive rebase, all the way to branching strategies, merge conflicts, sub modules, what have you. It's really helpful if you want to become more productive with Git and version control. And again, it's free. Alright, have fun and see you soon. Here on the Free Code Camp YouTube channel.

Info

Channel: freeCodeCamp.org

Views: 227,643

Rating: undefined out of 5

Keywords:

Id: qsTthZi23VE

Channel Id: undefined

Length: 34min 0sec (2040 seconds)

Published: Thu Nov 18 2021