Hey there! I'm Shay. I live, code and consult on data science in Tel Aviv, Israel. Among my current and past clients are such companies as Voyantis, BigPanda, Forter, Intezer, Bringg, Bank Hapoalim and quite a few others. I also serve as VP Data Science @ LeO, and I've headed the data science outfit at a couple of startups before striking out on my own. ๐
I have co-founded and manage DataHack, a non-profit / community / yearly hackathon dedicated to promoting data science and machine learning in Israel, which also runs projects such as DataTalks, DataNights and Kaggle-IL. I have also founded NLPH, a community meant to encourage and enable joint work on open NLP tools for Hebrew.
Project | Description | Stars | Downloads | Forks | Issues | PRs |
pdpipe | Easy pipelines for ๐ผ DataFrames | |||||
pulearn | Positive-unlabeled learning w/ ๐ | |||||
skift | sklearn wrappers for ๐ fastText | |||||
cachier | Persistent memoization ๐ decorators | |||||
birch | Hierarchical config for ๐ packages | |||||
awesome-twitter-data | Twitter datasets & resources |
- ๐ ย Data Science Project Flow for Startups
A data scientistโs take on our process. - ๐ ย Peer Reviewing Data Science Projects
Making your work more error-proof using peer scrutiny. - ๐ ย Document Embedding Techniques
A review of notable literature on the topic. - ๐ ย Inferring causality in time series data
A concise review of the major approaches. - ๐ง ย Understanding the scaling of Lยฒ regularization in the context of neural networks
A theoretical delve into the Lยฒ equation. - ๐ ย Stationarity in time series analysis
A review of the concept and types of stationarity. - ๐ ย Detecting stationarity in time series data
A theoretical and practical guide. - ๐งช ย Extensive Python testing on Travis CI
Testing open-source Python on several operating systems. - ๐ฆ ย Publishing your own Python package
A practical guide to packaging Python code.