imvladikon

Follow

Vladimir Gurevich imvladikon

Follow

62 followers · 714 following

Achievements

Achievements

Highlights

Pro

imvladikon/README.md

Hi

I'm Vladimir Gurevich, ML/NLP Engineer (IR tasks, such as Semantic Search, Information Extraction tasks, such as NER, Relation Extraction, etc.).

I am also interested in Speech Recognition and in LLMs.

Works:

jupyter-notebook-viewer - Jupyter Notebook Viewer for local files *.ipynb in browser without Jupyter Notebook installation.
wav2vec2-hebrew - package for speech recognition in Hebrew language using wav2vec2 models that were trained on Hebrew datasets (check out the datasets below).
distiller - distillation TextClassification and TokenClassification models using transformers library with different distillation methods.
spacy-trankit - spacy wrapper for Trankit (NLP pipeline for tokenization+dependency parsing+lemmatization, etc.)

Models:

t5-english-ner - NER model that based on T5 encoder that was trained on extremely small dataset.
sentence-transformers-alephbert - Sentence Transformers model that based on AlephBERT model for sentence similarity tasks.
het5_small_summarization - mt5-small based summarization model for Hebrew

Speech Recognition:

Datasets:

Contacts

Pinned Loading

jupyter-notebook-viewer jupyter-notebook-viewer Public

chrome extension for viewing Jupyter Notebooks in the browser without Jupyter Server

JavaScript 29 5
huawei-nlpcourse-project huawei-nlpcourse-project Public

Topic modeling and classification news on Hebrew with Neural Text Summarizer model

Python 1
distiller distiller Public

knowledge distillations for bert (classification, token classification models)

Python 1
wav2vec2-hebrew wav2vec2-hebrew Public

Speech Recognition for Hebrew (using wav2vec2 models)

Python 4 1

duckdb + huggingface datasets

1

#!/usr/bin/env python3

2

# -*- coding: utf-8 -*-

3

import duckdb

4

import pyarrow as pa

5

from datasets import Dataset

fuzzy_grouper.py

1

#!/usr/bin/env python3

2

# -*- coding: utf-8 -*-

3

"""

4

Simple fuzzy grouping of the list of the dictionaries using any string field and string similarities functions

5

Dependencies: