Data Exploration & Machine Learning, Hands-on

Practical Walkthroughs on Machine Learning, Data Exploration and Insight Finding







Welcome to amunategui.github.io, your portal for practical data science walkthroughs in the Python and R programming languages


I attempt to break down complex machine learning ideas and algorithms into practical applications using clear steps and publicly available data sets. If you're looking for applied walkthroughs of ML and AI concepts, you've come to the right place - happy learning!



Popular/New Posts:


All Posts:
  1. What they Didn’t Teach at Data Science School, and How to Fix It to 10x Your Career

  2. GDELT - World Events at Your Finger Tips and for Free!

  3. We Can All Be Internet Moguls — How to Create and Sell Your Machine Learning Product Online and For Free

  4. How to Create Your Own Free Email Signup Form and Enjoy 100% Creative Freedom - For Static & Semi-Static Web Sites

  5. From Financial Compliance to Fraud Detection with Conditional Variational Autoencoders (CVAE) and Tensorflow

  6. How Blogging and Making YouTube Videos Landed Me the Best Job

  7. Your Git Commit Comments, and What They Reveal About You

  8. Exploring Some Pair-Trading Concepts with Python

  9. My Six Favorite Free Data Science Classes and the Giants Behind Them

  10. Hosting a Flask Application on AWS Beanstalk

  11. TensorFlow Won the Attention Battle, Who’s Next?

  12. GPUs on Google Cloud - the Fast Way & the Slow Way

  13. Executive Time Management — Don’t Suffocate the Creative Process

  14. Pairing Reinforcement Learning and Machine Learning, an Enhanced Emergency Response Scenario

  15. Find Your Next Programming Language By Measuring “The Knowledge Gap” on StackOverflow.com

  16. My #1 Piece of Advice for Aspiring Data Scientists

  17. Chatbot Conversations From Customer Service Transcripts

  18. Serverless Hosting On Microsoft Azure - A Simple Flask Example

  19. Google Video Intelligence, TensorFlow And Inception V3 - Recognizing Not-So-Famous-People

  20. Rapid Prototyping on Google App Engine - Build a Trip Planner with Google Maps and Yelp

  21. Yelp v3 and a Romantic Trip Across the USA, One Florist at a Time

  22. Show it to the World! Build a Free Art Portfolio Website on GitHub.io in 20 Minutes!

  23. Google Video Intelligence and Vision APIs - Automatically Recognize Actors and Download their Biographies in Real Time

  24. Life Coefficients - Modeling Life Expectancy and Prototyping it on the Web with Flask and PythonAnywhere

  25. Convolutional Neural Networks And Unconventional Data - Predicting The Stock Market Using Images

  26. The Fallacy of the Data Scientist's Venn Diagram

  27. Reinforcement Learning - A Simple Python Example and a Step Closer to AI with Assisted Q-Learning

  28. Simple Heuristics - Graphviz and Decision Trees to Quickly Find Patterns in your Data

  29. Office Automation Part 3 - Classifying Enron Emails with Google's Tensorflow Deep Neural Network Classifier

  30. Office Automation Part 2 - Using Pre-Trained Word-Embedded Vectors to Categorize the Enron Email Dataset

  31. Office Automation Part 1 - Sorting Departmental Emails with Tensorflow and Word-Embedded Vectors

  32. Easy Market Profile in Python: Grasp Price Action Quickly

  33. What-if Roadmap - Assessing Live Opportunities and their Paths to Success or Failure

  34. Where Are Your Customers Coming From And Where Are They Going - Reporting On Complex Customer Behavior In Plain English With C5.0

  35. Databricks, SparkR and Distributed Naive Bayes Modeling

  36. R and Azure ML - Your One-Stop Modeling Pipeline in The Cloud!

  37. Get Your "all-else-held-equal" Odds-Ratio Story for Non-Linear Models!

  38. Predict Stock-Market Behavior using Markov Chains and R

  39. Big Data Surveillance: Use EC2, PostgreSQL and Python to Download all Hacker News Data!

  40. The Peter Norvig Magic Spell Checker in R

  41. Actionable Insights: Getting Variable Importance at the Prediction Level in R

  42. Survival Ensembles: Survival Plus Classification for Improved Time-Based Predictions in R

  43. Anomaly Detection: Increasing Classification Accuracy with H2O's Autoencoder and R

  44. H2O & RStudio Server on Amazon Web Services (AWS), the Easy Way!

  45. Analyze Classic Works of Literature from Around the World with Project Gutenberg and R

  46. Speak Like a Doctor - Use Natural Language Processing to Predict Medical Words in R

  47. Supercharge R with Spark: Getting Apache's SparkR Up and Running on Amazon Web Services (AWS)

  48. R and Excel: Making Your Data Dumps Pretty with XLConnect

  49. Going from an Idea to a Pitch: Hosting your Python Application using Flask and Amazon Web Services (AWS)

  50. Getting PubMed Medical Text with R and Package {RISmed}

  51. Find Variable Importance for any Model - Prediction Shuffling with R

  52. Bagging / Bootstrap Aggregation with R

  53. Feature Hashing (a.k.a. The Hashing Trick) With R

  54. Yelp, httr and a Romantic Trip Across the United States, One Florist at a Time

  55. Quantifying the Spread: Measuring Strength and Direction of Predictors with the Summary Function

  56. Downloading Data from Google Trends And Analyzing It With R

  57. Using String Distance {stringdist} To Handle Large Text Factors, Cluster Them Into Supersets

  58. SMOTE - Supersampling Rare Events in R

  59. Let's Get Rich! See how {quantmod} And R Can Enrich Your Knowledge Of The Financial Markets!

  60. How To Work With Files Too Large For A Computer’s RAM? Using R To Process Large Data In Chunks

  61. Predicting Multiple Discrete Values with Multinomials, Neural Networks and the {nnet} Package

  62. Modeling 101 - Predicting Binary Outcomes with R, gbm, glmnet, and {caret}

  63. Reducing High Dimensional Data with Principle Component Analysis (PCA) and prcomp

  64. The Sparse Matrix and {glmnet}

  65. Brief Walkthrough Of The dummyVars Function From {caret}

  66. Ensemble Feature Selection On Steroids: {fscaret} Package

  67. Mapping The United States Census With {ggmap}

  68. Using Correlations To Understand Your Data

  69. Brief Guide On Running RStudio Server On Amazon Web Services