- End-to-End Memory Networks : [https://yeazzing.tistory.com/2]
- Ettention is all you need : [https://zrr.kr/CMYW]
- Improving language understanding with unsupervised learning : [https://zrr.kr/NafS]
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
- Towards Large Language Models as Copilots for Theorem Proving in Lean : [https://yeazzing.tistory.com/44]
- Time is Encoded in the Weights of Finetuned Language Models : [https://yeazzing.tistory.com/45]
- HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science : [https://zrr.kr/wNwv]