How I Would Learn Data Science in 2023? (If I could start over)

Video Statistics and Information

Video
Captions Word Cloud
Reddit Comments
Captions
and don't know where to start well this video  is perfect for you because I am going to talk   about how I would learn data science in 2023 if  I were to start all over again I did a version   of this video in 2022 and I like to refresh  it every year with my updated thoughts based   on the industry Trends in the data science job  family so in this video I'm going to summarize   six steps that I would take to learn data science  we would be talking about what research you need   to do and then we'll jump into what skill set  you need to learn followed by what do you need   to do after you're done with all of that as we  go through the steps I suggest you take notes   I'm going to link down a notion template below  which you can use when researching different   data science roles different data science job  family and for creating your study plan number   one thing that I would do to become a data  scientist in 2023 is I would research the   role when I started to learn data science I  heard that data science is the sexiest job   Emily thanks to the Harvard Business review and  I jumped in I wish I wish I had done the research   try to understand the data science job family a  little bit more to understand what encompasses   the job family and what are the different options  in there before I jump into one in 2023 there are   many roles within the data scientist jaw family  and I'm going to list a few of them there is data   Engineers machine learning engineer data scientist  product data scientist data analyst just to name a   few so if you are just starting out and don't know  anything about these job families I would suggest   that you read up on each of these role and try to  understand what they do before jumping into any of   the roles that are in the data science job family  because this step will give you actually a really   really good idea what the expectations of the  role are and what you will enjoy doing so let's   say you have decided that you want to become  a data scientist generalist you can think of   it as like full stack engineer who does back-end  friend that the middle and everything in between   you can think of it as like a full stack data  scientist that does pretty much everything they   know statistics machine learning they know coding  they can take the business problem and apply data   science to solve those problems and add value to  the business so let's say you figured out that   data scientist role is what you want to do you're  not done researching yet what I would suggest you   to do is go read the job descriptions for the  data scientists role at different companies   that you are interested in as I mentioned in  some of my previous videos data scientist is   a slightly difficult for Deaf Family to be in  because the definition for a data scientist is   not well defined from company to company so this  is why you need to put a lot of attention on this   step because your study plan your roadmap is going  to look different depending on what companies and   what specific roles that you're trying to Target  let's say Amazon is the company that you want to   Target and you want to become a data scientist  there you will go to the job description for the   data scientist open roles at Amazon and you will  try to understand what are the skill sets required   second you will go to LinkedIn look at people who  currently work as a data scientist or have worked   as a data scientist at Amazon and try to look  at those people's educational background what   did they study then the type of projects that  they've worked as a data scientists basically   we're actually doing this research and we're  basically actually doing data science here   statistics we basically ran collecting samples of  people and trying to understand what are the what   are their educational history what is the type  of project that they do this is also a good time   to look at if they're like specific certificates  that they have these people have specific degree   programs specific boot camps that these people  took that basically helped them so this will get   you actually a lot of information so let's  say you've done your research where you're   ready for the third step the third thing I  would do is I would learn the fundamentals   of data science you probably were not expecting  to hear that because a lot of other advice that   you have heard is basically is telling you to  start coding I strongly believe that in order   to be a data scientist a good data scientist you  actually need to have a solid Theory knowledge of   statistics and machine learning before you get  to the coding part and there's a reason why the   coding languages is a way to apply data science  they are not the data science itself you can ask   any data scientist who is currently working in the  industry or is going to school you can ask them   like what is data science they will tell you data  science is pretty much statistics domain knowledge   and machine learning knowledge so that's why it's  important for you to build those fundamentals   before you start getting into coding now here one  thing I would say that if you hate coding then   I would not pursue a data science because data  science does require coding and if you have never   coded in your life in that case I would suggest  you to try out coding first before you jump into   the theory but theory is so so important and it  basically builds the foundation for data science   so I would suggest you to learn statistics and  learn the machine learning fundamentals before   you jump into coding and we're doing this step  but we're not going to go too deep we're going   to go at the high level you're going to try to  understand what statistics actually is you're   going to try to understand what machine learning  actually is so before you jump into the next part   let's say if somebody comes and asks what is the  difference between linear and logistic regression   you should be able to explain it that's the  level of knowledge that I suggest you to have   before you jump into the coding part the coding  part would actually when you apply the knowledge   you will actually get to learn statistics and  machine learning much more but for the initial   learning period I would suggest to stick to the  fundamentals learn statistics mathematics and   machine learning so these are the three areas  that I would suggest to build your fundamentals   on so here is how I would approach it for math I  would be very comfortable with linear algebra for   statistics I would get pretty comfortable  with probability distributions hypothesis   testing Bayesian versus frequencies so those are  basically the basics in statistics the third I   would suggest you to go into machine learning  and machine learning try to understand what are   the types of machine learning there is supervised  unsupervised reinforcement and then within each   try to understand what is a regression what is  a logistic regression what is classification   what is decision tree we're not going too deep  but we're understanding it enough that we we   can explain theoretically what they are because  we're gonna go back to it again now a word from   our sponsors simply learn if you're trying to  start your career in data science and looking   for a structured program that simply learns data  science boot camp might be a good fit for you the   program is developed in partnership with Caltech  University which is ranked number nine in the US   and in my opinion gives that credibility it's  a six month cohort based boot camp but 25 plus   Hands-On projects going through the curriculum  I really like that it starts with building   foundation in statistics and machine learning and  then jumps into coding and teaches python SQL and   R with hands-on experience in three different  domains also I really like that it focuses on   interview prep which is much needed because we all  know data science interviews are not easy in this   program you will learn from global data science  faculty who have combined 40 years of experience   the cohort is starting soon and has limited  seats I'm linking The Bootcamp below check it   out it might be a good fit for you now back to the  video so let's say you build your Theory you have   a solid foundation in statistics machine learning  now it's time for you to go to the next step which   is learn to code there are many languages that  you can learn for data science for the simplest   it is take I would suggest to start with SQL  and python I personally started with sqlnr r   has a very steep learning curve and it's not as  intuitive as python but R has a lot of ready to go   statistics methods available to you that you can  use for for doing like a very quick analysis but   art cannot be productionized by whereas python  can live in a production environment I would   suggest you to start with python and SQL python  is super intuitive and as an industry I have seen   that python has actually been taking off because  it's easy to understand by everybody involved in   the project including software engineers and  data scientists for learning SQL and python   SQL you can you can go with any Learning Resource  for learning SQL it's pretty straightforward you   just need to understand how to join different data  sets in our right left self joins which is like a   weak point for me how to do SUB queries how to do  window function for python now there are a lot of   courses that are out there for a learning python  here I would like you to pay special attention   to focus on learning python that is specific  to data analysis for example there are several   libraries that are focused on data science and  data analysis and machine learning such as pandas   numpy scikit-learn matplotlib just be mindful when  you're picking your Learning Resource for python   make sure that it has a focus of data science  because there's a ton of learning material out   there that teaches to software engineers and  non-software Engineers I would suggest you to   focus on python that is more targeted toward  data science at this point you have a really   good understanding of fundamentals of data science  which is statistic machine learning math you have   coding knowledge which is SQL and python now  is actually it's time to apply the skills that   you have learned so far and turn it into a project  there is a possibility that while you are learning   all those things learning by python learning SQL  through whatever Learning Resource that you use   you probably already did the projects so you have  like some hands-on experience but this step is   specifically to build your portfolio and to get  you more Hands-On knowledge on how to do things   start building your project portfolio and remember  the first step that we did or the second step that   we did where we were looking at different people  on LinkedIn who are working at your target company   in that role if you have already written down  what type of projects that they do in their role   as a data scientist this will give you actually a  very good idea of what type of projects to Target   what kind of focus areas that you need to have  in your project portfolio if you're looking for   data there are actually a lot of free resources  available to you where which has tons of data   and problem sets that you can use to build your  product portfolio listing a few including kaggle   Google data search US Census Bureau and you don't  have to be limited to the data that is available   in this platform you can actually make your  own data for example you can look at your your   purchase history on your credit card download  that data turn this into a data science problem   and do a project project on it identifying your  purchase trends for example How likely are you   to buy a coffee if it's raining what I'm trying  to say here is like you can look at different   data set make your own problem and try to build  projects around it build at least five projects   get your data from kaggle Google data search  or make up your own data Target a domain that   you are interested in but also a domain where  you want to get into for example if you want to   be a data scientist in e-commerce then you would  pick a data data set that is related to that and   solve a problem that is related to e-commerce I  would also suggest if you have the option is to   build the online project portfolio link it on  your LinkedIn link it on your resume and build   a GitHub portfolio this is optional I personally  didn't do a GitHub portfolio but if I were to do   it again and I don't have any experience I would  build a GitHub portfolio so recruiters can look   at it and they have additional information on my  skill set that they probably don't have on other   candidates who don't have GitHub portfolio the  reason you're learning trying to become a data   scientist is to get a job if you are doing it just  because you're curious like that's great but most   people who are trying to become a data scientist  they want to get a job so the sixth step that I   would recommend is to prepare for interviews and  the reason I say this is because a lot of people   discount how much work interviewing is knowing  the skill versus doing it in an interview setting   where you are under a pressure environment and  you have to answer in a time constraint manner I would start practicing on a platform like lead  code or start a scratch I'm going to link it below   and that is for SQL and Python and then for your  fundamentals in your theoretical knowledge I would   start mock interviewing and start practicing  with a friend have them ask you questions so   that way you are ready for the interview itself  I've created a detailed video on how I prepare   for interviews and what is my process you can go  and watch it here I'm going to link it somewhere   here it goes in a lot more detail that I'm going  in this video so hopefully by the end of this   process you are able to go into an interview and  perform to get a job offer now that being said I   do want to mention that over the last few years  given that there's so much interest in the job   family it has become more and more competitive so  don't be discouraged if you don't get your job on   the first try so this is the process that I would  use if I were to learn data science all over again   do any of these steps resonated with you surprised  you let me know in comments with that thank you   so much for watching this video and I will see  you in a different one have a beautiful day bye
Info
Channel: Sundas Khalid
Views: 161,320
Rating: undefined out of 5
Keywords: data science, data scientist, self-taugh data scientist, big tech, Machine Learning, Python, data science projects, data science tutorials, data science jobs, data science day in the life, AI, big data, data analyst, data analytics, business analyst, sundas khalid, sundas, people of color in tech, data science at google, data scientist 2022, data scientist 2023, how I would learn data science, data science trends 2023, simplilearn, Simplilearn, bootcamp, data engineer roadmap
Id: ZWgRvW8d_N4
Channel Id: undefined
Length: 12min 23sec (743 seconds)
Published: Sun Jan 01 2023
Related Videos
Note
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.