introduction to python for data science
TRANSCRIPT
Introduction to Python for Data Science
Who’s Karlijn?● Data Science Journalist @DataCamp
● Master’s degrees in Information Management, Literature & Linguistics
● Worked as a junior big data developer with Scala, Hadoop & Spark
● Love for literature, languages, data science & big data
● … I also love to talk, so please stop me whenever you have any questions!
@willems_karlijn
DataCamp - Learning By Doing✓ Interactive learning for R, Python
& Data Science
✓ Over 730,000 learners, 70% working professionals
✓ +1000 business groups
Data Science - Useful Links● Data Science Industry Infographic
● Ba!le of the Data Science Venn Diagrams
● …
Useful Links● Khan Academy
● Python Statistics For Data Science Resources
● Statistical Thinking in Python (Part 1)
● Scipy Tutorial: Linear Algebra
● Algorithms with Stanford Unversity
● Machine Learning with Stanford University (Andrew Ng)
● Tom Mitchell, “Machine Learning”
Useful Links● Fundamentals of Computer Science (EdX course)
● End-to-end Development Process (Article)
● Intro to Python for Data Science Course (DataCamp)
Resources
Useful Links● Introduction to Databases with Stanford University
● Principles of Database Management (Youtube Series)
● Alejandro Vaisman & Esteban Zimányi, “Data Warehouse Systems”
● SQL with DataCamp (soon!)
● Introduction to Databases in Python
Useful Links● Martin Fowler, “Continuous Integration”
Useful Links● Introduction to Apache Spark (and follow-up courses on
EdX)
● Hadoop: The Definitive Guide
● Martin Odersky, “Programming in Scala”
Useful Links● Kaggle
● DrivenData
● Meetup
● Data.World
● Github Tutorial
You don’t need to start from scratch!
Useful Links● Galvanize
● Quantbros
● General Assembly
● Your network!
● …
Useful Links● DataTau
● Reddit - /r/datascience, /r/python, …
● KD Nuggets
● DataCamp Community
● FiveThirtyEight
● Stack Overflow
● ….
People To Follow● DJ Patil, Chief Data Scientist with the White House.
● Gregory Piatetsky, KDnuggets President, #Analytics, #BigData, #DataMining, #DataScience expert, KDD & SIGKDD co-founder, was Chief Scientist at 2 startups, part-time philosopher.
● Ben Lorica, Chief Data Scientist @OReillyMedia, Program Director of @strataconf & @OReillyAI. He is the host of the O’Reilly Data Show podcast.
● Andrew Ng, Chief Scientist of Baidu; Chairman and Co-Founder of Coursera; Stanford CS faculty.
● As a top Big Data influencer, Kirk Borne, the Principal Data Scientist at @BoozAllen, Ph.D. Astrophysicist, ♡ Data Science, is definitely worth
following!
Thanks!
Questions?