💶Kafka-SparkStreamNLP 是一个基于docker容器化管理的实时金融文本分析平台,通过新闻api,采用 Kafka 进行数据流管理,使用 Spark Streaming 结合微调预训练模型finetuning进行NLP处理,并通过输出流将结果存储在clickhouse以便后续使用可视化平台进行olap分析⭐️⭐️⭐️⭐️⭐️
-
Updated
Dec 28, 2024 - Jupyter Notebook
💶Kafka-SparkStreamNLP 是一个基于docker容器化管理的实时金融文本分析平台,通过新闻api,采用 Kafka 进行数据流管理,使用 Spark Streaming 结合微调预训练模型finetuning进行NLP处理,并通过输出流将结果存储在clickhouse以便后续使用可视化平台进行olap分析⭐️⭐️⭐️⭐️⭐️
Extrinsic and Intrinsic Plagiarism detection
Work focus on Transformer model to Start classification (1-5) about reviews of YELP.
Academic Sequence Labelling Between DistillBERT & Encoder-only Transformer
Distilbert model for sentence segmentation.
Small NLP projects with Deep Learning techniques
Classification of Text from Youtube Comments using BistillBERT alanguage models from Hugginface Transformers
Add a description, image, and links to the distillbert-model topic page so that developers can more easily learn about it.
To associate your repository with the distillbert-model topic, visit your repo's landing page and select "manage topics."