Data Exploration & Machine Learning, Hands-on


"

Recommended free walkthrough, check it out and boost your career:


Boost your data science career

"







Welcome to amunategui.github.io, your portal for practical data science walkthroughs in the Python and R programming languages


I attempt to break down complex machine learning ideas and algorithms into practical applications using clear steps and publicly available data sets. If you're looking for applied walkthroughs of ML and AI concepts, you've come to the right place - happy learning!



Popular/New Posts:


All Posts:
  1. Let's Build a Web App to Design Titanic Passengers - Walk Through of Chapter 3 from My Book Monetizing Machine Learning

  2. One Line of Code to Send Messages to a Discord Server

  3. Grow Your Web Brand, Visibility & Traffic Organically - 5 Years of amunategui.github.io

  4. The Python and Flask Rest API, Abstracting Functions for Web Applications and SaaS

  5. What they Didn’t Teach at Data Science School, and How to Fix It to 10x Your Career

  6. GDELT - World Events at Your Finger Tips and for Free!

  7. We Can All Be Internet Moguls — How to Create and Sell Your Machine Learning Product Online and For Free

  8. How to Create Your Own Free Email Signup Form and Enjoy 100% Creative Freedom - For Static & Semi-Static Web Sites

  9. From Financial Compliance to Fraud Detection with Conditional Variational Autoencoders (CVAE) and Tensorflow

  10. How Blogging and Making YouTube Videos Landed Me the Best Job

  11. Your Git Commit Comments, and What They Reveal About You

  12. Exploring Some Pair-Trading Concepts with Python

  13. My Six Favorite Free Data Science Classes and the Giants Behind Them

  14. Hosting a Flask Application on AWS Beanstalk

  15. TensorFlow Won the Attention Battle, Who’s Next?

  16. GPUs on Google Cloud - the Fast Way & the Slow Way

  17. Executive Time Management — Don’t Suffocate the Creative Process

  18. Pairing Reinforcement Learning and Machine Learning, an Enhanced Emergency Response Scenario

  19. Find Your Next Programming Language By Measuring “The Knowledge Gap” on StackOverflow.com

  20. My #1 Piece of Advice for Aspiring Data Scientists

  21. Chatbot Conversations From Customer Service Transcripts

  22. Serverless Hosting On Microsoft Azure - A Simple Flask Example

  23. Google Video Intelligence, TensorFlow And Inception V3 - Recognizing Not-So-Famous-People

  24. Rapid Prototyping on Google App Engine - Build a Trip Planner with Google Maps and Yelp

  25. Yelp v3 and a Romantic Trip Across the USA, One Florist at a Time

  26. Show it to the World! Build a Free Art Portfolio Website on GitHub.io in 20 Minutes!

  27. Google Video Intelligence and Vision APIs - Automatically Recognize Actors and Download their Biographies in Real Time

  28. Life Coefficients - Modeling Life Expectancy and Prototyping it on the Web with Flask and PythonAnywhere

  29. Convolutional Neural Networks And Unconventional Data - Predicting The Stock Market Using Images

  30. The Fallacy of the Data Scientist's Venn Diagram

  31. Reinforcement Learning - A Simple Python Example and a Step Closer to AI with Assisted Q-Learning

  32. Simple Heuristics - Graphviz and Decision Trees to Quickly Find Patterns in your Data

  33. Office Automation Part 3 - Classifying Enron Emails with Google's Tensorflow Deep Neural Network Classifier

  34. Office Automation Part 2 - Using Pre-Trained Word-Embedded Vectors to Categorize the Enron Email Dataset

  35. Office Automation Part 1 - Sorting Departmental Emails with Tensorflow and Word-Embedded Vectors

  36. Easy Market Profile in Python: Grasp Price Action Quickly

  37. What-if Roadmap - Assessing Live Opportunities and their Paths to Success or Failure

  38. Where Are Your Customers Coming From And Where Are They Going - Reporting On Complex Customer Behavior In Plain English With C5.0

  39. Databricks, SparkR and Distributed Naive Bayes Modeling

  40. R and Azure ML - Your One-Stop Modeling Pipeline in The Cloud!

  41. Get Your "all-else-held-equal" Odds-Ratio Story for Non-Linear Models!

  42. Predict Stock-Market Behavior using Markov Chains and R

  43. Big Data Surveillance: Use EC2, PostgreSQL and Python to Download all Hacker News Data!

  44. The Peter Norvig Magic Spell Checker in R

  45. Actionable Insights: Getting Variable Importance at the Prediction Level in R

  46. Survival Ensembles: Survival Plus Classification for Improved Time-Based Predictions in R

  47. Anomaly Detection: Increasing Classification Accuracy with H2O's Autoencoder and R

  48. H2O & RStudio Server on Amazon Web Services (AWS), the Easy Way!

  49. Analyze Classic Works of Literature from Around the World with Project Gutenberg and R

  50. Speak Like a Doctor - Use Natural Language Processing to Predict Medical Words in R

  51. Supercharge R with Spark: Getting Apache's SparkR Up and Running on Amazon Web Services (AWS)

  52. R and Excel: Making Your Data Dumps Pretty with XLConnect

  53. Going from an Idea to a Pitch: Hosting your Python Application using Flask and Amazon Web Services (AWS)

  54. Getting PubMed Medical Text with R and Package {RISmed}

  55. Find Variable Importance for any Model - Prediction Shuffling with R

  56. Bagging / Bootstrap Aggregation with R

  57. Feature Hashing (a.k.a. The Hashing Trick) With R

  58. Yelp, httr and a Romantic Trip Across the United States, One Florist at a Time

  59. Quantifying the Spread: Measuring Strength and Direction of Predictors with the Summary Function

  60. Downloading Data from Google Trends And Analyzing It With R

  61. Using String Distance {stringdist} To Handle Large Text Factors, Cluster Them Into Supersets

  62. SMOTE - Supersampling Rare Events in R

  63. Let's Get Rich! See how {quantmod} And R Can Enrich Your Knowledge Of The Financial Markets!

  64. How To Work With Files Too Large For A Computer’s RAM? Using R To Process Large Data In Chunks

  65. Predicting Multiple Discrete Values with Multinomials, Neural Networks and the {nnet} Package

  66. Modeling 101 - Predicting Binary Outcomes with R, gbm, glmnet, and {caret}

  67. Reducing High Dimensional Data with Principle Component Analysis (PCA) and prcomp

  68. The Sparse Matrix and {glmnet}

  69. Brief Walkthrough Of The dummyVars Function From {caret}

  70. Ensemble Feature Selection On Steroids: {fscaret} Package

  71. Mapping The United States Census With {ggmap}

  72. Using Correlations To Understand Your Data

  73. Brief Guide On Running RStudio Server On Amazon Web Services