class: center, middle, inverse, title-slide # All the ways to break into data science and analytics ## SQL Saturday #759 ### Taras Kaduk ### 2018/05/05 --- # What's the crowd? -- ### - Some are in tech, but not in data -- ### - Some are students or recent graduates -- ### - Some are neither -- ### - Those few already in the data field. *What are you doing here?* --- class: center, middle # Let the data science venn diagram madness begin! --- ## The Original. Mkay. Data Science != Data Scientist <img src="https://static1.squarespace.com/static/5150aec6e4b0e340ec52710a/t/51525c33e4b0b3e0d10f77ab/1364352052403/Data_Science_VD.png" height="450"> http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram --- ## Hmmm... Interesting puzzle <img src="https://whatsthebigdata.files.wordpress.com/2016/07/datascientist_diagram.png" height="450"> https://whatsthebigdata.com/2016/07/08/the-new-data-scientist-venn-diagram/ --- ## Is this a Jumanji map? ![](https://lh5.ggpht.com/-2KvMzMGADug/T9hXUiLufMI/AAAAAAAAGhQ/0Uwr9-he2Dg/image.png) http://www.oralytics.com/2012/06/data-science-is-multidisciplinary.html --- ## Looks like my son's pre-school art, but worse <img src="https://s-media-cache-ak0.pinimg.com/originals/6b/49/d1/6b49d162e80ec2c99acd0717914666a1.jpg" height="450"> --- ## Of course, the unicorns! <img src="https://1.bp.blogspot.com/-ju4m6PBOrgo/V-E5qz99SaI/AAAAAAAAMF0/gle0zsZz_nIBEMVg0EdZHoGJhjlnBzv1gCLcB/s1600/moz-screenshot-3-729576.png" height="500"> --- ## Speaking of unicorns... ![](https://i.imgur.com/vmS0KlW.jpg) https://imgur.com/gallery/T4Wy0 --- class: center, middle # Please stop <img src="https://media0.giphy.com/media/27EhcDHnlkw1O/giphy.gif" height="500"> --- # Common language <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">[...] lots more jobs do data science than are called data scientists</p>— Hadley Wickham (@hadleywickham) <a href="https://twitter.com/hadleywickham/status/913374306062802945?ref_src=twsrc%5Etfw">September 28, 2017</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> -- <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">Personally I think the difference is programming - if you’re programming an analysis you’re doing data science</p>— Hadley Wickham (@hadleywickham) <a href="https://twitter.com/hadleywickham/status/913378018139262977?ref_src=twsrc%5Etfw">September 28, 2017</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # Data science career checklist <br><br> ## - Get a PhD ## - Wait for job offers to pour in --- # Data science career checklist <br><br> ## - ~~Get a PhD~~ ## - ~~Wait for job offers to pour in~~ --- <img src="https://pix-media.priceonomics-media.com/blog/1310/chart4.jpeg" height="500"> ~ How Diverse is Data Science? https://priceonomics.com/how-diverse-is-data-science/ --- <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">it is very intimidating when all (most) of the openings for datascience jobs has phd as a requirement</p>— ദാമു(ക്കുട്ടൻ) (@damukkuttan) <a href="https://twitter.com/damukkuttan/status/913341143017861121?ref_src=twsrc%5Etfw">September 28, 2017</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> -- <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">Agreed - although I think there's still room for people to enter data science (via analytics) by showing they have the skills necessary. <a href="https://t.co/sJcEAy1by7">https://t.co/sJcEAy1by7</a></p>— Jesse Maegan (@kierisi) <a href="https://twitter.com/kierisi/status/913342532582625281?ref_src=twsrc%5Etfw">September 28, 2017</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> -- <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">Also lots more jobs do data science than are called data scientists</p>— Hadley Wickham (@hadleywickham) <a href="https://twitter.com/hadleywickham/status/913374306062802945?ref_src=twsrc%5Etfw">September 28, 2017</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # ...Yet some folks are chill though... <blockquote class="twitter-tweet" data-lang="en"><p lang="en" dir="ltr">Bewildered by how many software engineers applied for the data scientist role who have no experience or training in analyzing data.</p>— Mikhail Popov (@bearloga) <a href="https://twitter.com/bearloga/status/958492053108764672?ref_src=twsrc%5Etfw">January 31, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # Roll the tape <p><a href="https://www.rstudio.com/resources/videos/fireside-chat-with-rstudio-r-in-industry-discussion/?wvideo=jfuapunqzv"><img src="https://embedwistia-a.akamaihd.net/deliveries/d19b8213a5788a881f1f6d680bd8004fc532724e.jpg?image_play_button_size=2x&image_crop_resized=960x540&image_play_button=1&image_play_button_color=4287c7e0" width="400" height="225" style="width: 400px; height: 225px;"></a></p><p><a href="https://www.rstudio.com/resources/videos/fireside-chat-with-rstudio-r-in-industry-discussion/?wvideo=jfuapunqzv">Fireside Chat with RStudio – R in industry discussion – RStudio (32:46)</a></p> > If you do not have a PhD in Statistics and you want to be a data scientist, you need to give me the evidence that you can do data science. > ~ Eduardo Ariño de la Rubia [What do you look for when hiring an entry-level data scientist? Would a master’s in Data Science or a bootcamp be beneficial? ~Quora](https://www.quora.com/What-do-you-look-for-when-hiring-an-entry-level-data-scientist-Would-a-master%E2%80%99s-in-Data-Science-or-a-bootcamp-be-beneficial) --- # Unconditional advice -- <br><br> ### - Start a blog (maintain a portfolio) ### - Build, use, and give back to your network ### - Keep learning --- # Start a blog -- #### - Practice analyzing data and communicating about it #### - Create a portfolio of your work and skills #### - Get feedback and evaluation http://varianceexplained.org/r/start-blog/ -- > My promise is this: **if you’re early in your career as a data scientist and you start a data-related blog, tweet me a link at @drob and I’ll tweet about your first post** (in fact, the offer’s good for each of your first three posts) *~ David Robinson* --- # Build, use, and give back to your network -- #### - Volunteer at user groups, meetups, and yes, SQL Saturdays #### - Ask to be a speaker #### - Learn from the best and immediately share that knowledge back #### - Volunteer your skills #### - Contribute to open source -- <blockquote class="twitter-tweet" data-lang="en"><p lang="en" dir="ltr">want to build data science experience? reach out to a local non-profit you're interested in, and ask them if you can volunteer with data collection, cleaning, and basic analysis and reporting. you get experience, the NPO gets a product they desperately need, and everyone wins.</p>— Jesse Maegan (@kierisi) <a href="https://twitter.com/kierisi/status/979772042525528064?ref_src=twsrc%5Etfw">March 30, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- #This is not a pyramid scheme <img src="https://i.pinimg.com/originals/3f/b9/3b/3fb93b191a71d5e5700121801764ea65.jpg" height=500 class="center"> --- # Keep learning -- <br><br> ### - MOOCs, online classes and learning platforms ### - Traditional education ### - Immersion --- # Find your niche in the data science food chain -- - Teach what you've learned - through blogging - through speaking - through message boards and Stack Overflow - Be an aggregator and a connector - Blog on a specific topic (e.g. politics, basketball, music) - Contribute to open-source (even if it's correcting typos) - Be a computer scientist for data scientists. --- # Some more specific advice -- - If you feel you're lacking some formal education - consider going back to school --- ### What happens if the explanatory and response variables are sorted independently before regression? > Suppose we have data set (Xi,Yi) with n points. We want to perform a linear regression, but first we sort the Xi values and the Yi values independently of each other, forming data set (Xi,Yj). Is there any meaningful interpretation of the regression on the new data set? Does this have a name? > I imagine this is a silly question so I apologize, I'm not formally trained in statistics. In my mind this completely destroys our data and the regression is meaningless. But my manager says he gets "better regressions most of the time" when he does this (here "better" means more predictive). I have a feeling he is deceiving himself. https://stats.stackexchange.com/questions/185507/what-happens-if-the-explanatory-and-response-variables-are-sorted-independently --- # Some more specific advice - If you feel you're lacking some formal education - consider going back to school -- - If you're a recent graduate, consider internships --- # From the author *"If it's not a **hell yeah** - it's a **no**"*... <br><br> > When you’re earlier in your career I think the best strategy is you just say yes to everything, every piddly little gig, you just never know what are the lottery tickets. **~ Derek Sivers** --- # Some more specific advice - If you feel you're lacking some formal education - consider going back to school - If you're a recent graduate, consider internships -- - If you're a working professional, see if you can pivot within your organization. - business analyst - mentor / sponsor - cross-training - improve your work with data --- # Example <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">The last three jobs I've held have been interested in my combination of data / statistics and communication -- as judged by blogging and presentations 🙂</p>— Caitlin Hudon👩🏼 💻 (@beeonaposy) <a href="https://twitter.com/beeonaposy/status/991058135728181250?ref_src=twsrc%5Etfw">April 30, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # Example <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">And I got an internship at the new york times due to my website/github presence. No formal journalism education.</p>— Nick Strayer (@NicholasStrayer) <a href="https://twitter.com/NicholasStrayer/status/991068233305415680?ref_src=twsrc%5Etfw">April 30, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # Example <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">I have no formal education in data science/analytics, it's all been down to GitHub and website.</p>— John Coene (@jdatap) <a href="https://twitter.com/jdatap/status/991074226521169925?ref_src=twsrc%5Etfw">April 30, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # Example <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">I landed this job with my publication in Ecological Economics + bitbucket repo for the python module, 2 shiny Apps I built and the ability to successfully whiteboard in an interview.</p>— Samantha Sifleet (@SamanthaSifleet) <a href="https://twitter.com/SamanthaSifleet/status/991023890724937728?ref_src=twsrc%5Etfw">April 30, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # Example <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">I did it: combo of MOOCs, other independent learning, and turning the bits/examples of work I’d done into something formal enough to pass an interview. Happy to give more details if you want to DM.</p>— Hamed (@HamedBH) <a href="https://twitter.com/HamedBH/status/991025064370896896?ref_src=twsrc%5Etfw">April 30, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # Example <blockquote class="twitter-tweet" data-conversation="none" data-lang="en"><p lang="en" dir="ltr">i got my first data science job through blogging on my company’s internal social platform which led to tech talks and eventually the job.</p>— jeff benzos (@jeff\__benzos) <a href="https://twitter.com/jeff__benzos/status/991180361706041344?ref_src=twsrc%5Etfw">May 1, 2018</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script> --- # The end ## Follow me: - [https://taraskaduk.com](https://taraskaduk.com) - [https://www.linkedin.com/in/taraskaduk/](https://www.linkedin.com/in/taraskaduk/) - [https://github.com/taraskaduk/](https://github.com/taraskaduk/) - [https://twitter.com/taraskaduk](https://twitter.com/taraskaduk) ## This presentation is here ### https://taraskaduk.github.io/sqlsat.html