Some nice links:
- N-Shot Learning: Learning More with Less Data
- Meta-Learning: Learning to Learn Fast
- Papers:
Some nice links:
A very simple and up-to-date explanation of BERT
It can be quite tricky getting around to deploying TF models –
In fact there are multiple ways to save/load TF models, with each serving a slightly different purpose / use-case.
References:
Code:
I have been trying to understand how ML algos in R fit together / compare with each other.
# | R | RevoScaleR | MML | Comment |
1 | lm | rxLinMod | — | Linear models |
2 | glm | rxGlm | — | Linear models |
3 | Glm w/
Binomial family and the logit link function |
rxLogit | rxLogisticRegression | Logistic regression |
4 | rpart | rxDtree | — | Decision Trees implementations |
5 | gbm | rxBTrees | rxFastTrees | Boosted Decision Tree implementations |
6 | —- | rxDForest | rxFastForest |
# | Title | Links |
1 | Fitting Logistic Regression Models | https://msdn.microsoft.com/en-us/microsoft-r/scaler-user-guide-logistic-regression |
2 | Generalized Linear Models | https://msdn.microsoft.com/en-us/microsoft-r/scaler-user-guide-generalized-linear-mode |
3 | rxDTree(): a new type of tree algorithm for big data | http://blog.revolutionanalytics.com/2013/07/rxdtree-a-new-type-of-tree-algorithm.html |
4 | A first look at rxBTrees | http://blog.revolutionanalytics.com/2015/03/a-first-look-at-rxbtrees.html |
5 | A First Look at rxDForest() | http://blog.revolutionanalytics.com/2014/01/a-first-look-at-rxdforest.html |
SGD is the perfect algorithm for use in online learning. Except it has one major drawback – is sensitive to feature scaling.
In some of my trials with the SGD learner in scikit-learn, I have seen terrible performance if I don’t do feature scaling.
Which begs the question – How does VW do feature scaling ? After all VW does online learning.
Tip:
It seems VW uses a kind of SGD that is scale variant:
References:
Code:
This is a nice link which lists the packages out there for Grpahical Models.
There was a discussion that came up the other day about L1 v/s L2, Lasso v/s Ridge etc.
In particular,
References:
I came across this interesting set of blog posts by Sergei Feldman on the use of bandit approaches in online recommendation.
In particular, the one I really enjoyed reading was the comparison of the approaches needed to solve the multi armed bandit problem. Need to play around with his code someday
References:
One of my friends recently asked me about the K-means algorithm.
In the discussion that followed, we ended up playing around with several visualizations available that do an awesome job of explaining this technique.
we also hacked around with some code from joel grus’ book (data science from scratch) to develop more intuition on the K-means algorithm.
Visualizations:
Code:
There are some very interesting insights which we got playing around with trying to use K-means to cluster an image containing different colors: