The Data Incubator Capstone Project

The project used the Patent assignment data from USPTO (https://www.uspto.gov/) to analyze what is happening in different sectors. The assignment activity is the activity happens between company to company or person to company. This usually might be a merger/acquisition or an employee assigns its patents to the current employer etc.

The first stage capstone mainly looks into the data numerically, e.g. how many patents are being assigned each year; what are the major companies in the sector that are dominating the assignment activities etc.

This second stage of capstone extracts the titles of those assigned patents for key words extraction and trying to find what is the research trend inside the sector (I set a cut-off time at 2010 to reduce the file size). After cleaning the text data, Latent Dirichlet allocation (LDA) is used for topic modeling to find the main topics in each sector. Also, KMeans and Affinity Propagation are tried for comparision of different clustering method and find the most meaningful number of clusters. For visualizations, pyLDAvis package (https://pyldavis.readthedocs.io/en/latest/readme.html) is used for an interactive interface.

The interactive plots need to be downloaded as html to view on local computer.

Inside the interactive plots folder, 10 files each represent a model ran for that sector. It shows the top hot words appeared in that sector.

For 'other topics' folder, there are some codes I tried to study the relationship between stock price and patent assignment, which no meaningful relationship is revealed at this moment.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.ipynb_checkpoints		.ipynb_checkpoints
interactive plots		interactive plots
old_code		old_code
other_topics		other_topics
README.md		README.md
capstone_first_stage.ipynb		capstone_first_stage.ipynb
capstone_second_stage.ipynb		capstone_second_stage.ipynb
capstone_second_stage2-gensim.ipynb		capstone_second_stage2-gensim.ipynb
capstone_second_stage2.ipynb		capstone_second_stage2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Data Incubator Capstone Project

About

Releases

Packages

Languages

ai-zhong/patent_research

Folders and files

Latest commit

History

Repository files navigation

The Data Incubator Capstone Project

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages