Jason Hao's Blog
0%

Zotero 是我目前使用的功能最强大的论文管理工具,其次我愿意使用 Mendeley. 但是今天主要介绍 Zotero,以及通过免费云盘 Onedrive 搭建一个用于闭环的学习环境。路资本主义羊毛,😺

Read more »

Probability and Statisitics

Machine Learning

Text Mining

Natural Language Processing

References

  1. https://github.com/graykode/nlp-roadmap#machine-learning

Learning Part

General Reading

  1. 机器之心专栏 各个组织的分享文章. SOTA models
  2. AI研习社
  3. Paperweekly 机器之心, Paperweekly Official Site
  4. Machine Deep Learning
  5. daiwk 深度学习集合 ❤️
  6. NLP-journey
  7. Aminer AI 报告,硕博论文 ❤️

综述文章

  • 预训练模型 Pre-trained Models for Natural Language Processing: A Survey. paper
  • 上下文嵌入 A Survey on Contextual Embeddings. paper
  • 文本分类 Deep Learning Based Text Classification: A Comprehensive Review. paper
  • 命名实体识别 A Survey on Deep Learning for Named Entity Recognition. paper
  • 对抗生成 A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications. paper
  • 关系抽取 More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction. paper
  • 知识图谱 A Survey on Knowledge Graphs: Representation, Acquisition and Applications. paper

Graph Neural Networks

  1. GNN note
  2. Knowledge Graph Embedding
  3. 从图到图卷积漫谈 GNN
  4. poows GNN 多个仓库收藏 ❤️, GNN 工具, 模型, 应用 ❤️
  5. awsome bert implements and paper ❤️
  6. awsome GCN implements and paper ❤️
  7. awesome-deep-gnn papers 📚
  8. ML survey 📚
  9. AI survey 📚
  10. 知识图谱 Documents 📚
  11. 《深入浅出图神经网络:GNN原理解析》配套代码 Pytorch ❤️
  12. GNN must read papers by 公孙茂团队 ❤️❤️❤️
  13. Kipf 对gcn的直觉 入门推荐 | Tony slids ❤️
  14. 唐杰老师图神经网络学习班 | 对应PPT
  15. GNN for NLP papers

Knowledge Graph

  1. 知识图谱必学课程 🎓 ❤️❤️❤️
  2. Stanford CS520: 2021 Knowledge Graphs 🎓 ❤️❤️
  3. introduce to KG by google ❤️
  4. 北京知识图谱学习小组 🎓 ❤️❤️
  5. awesome knowledge graph 整理知识图谱相关学习资料,提供系统化的知识图谱学习路径
  6. 刘焕勇 知识图谱相关项目参考, 中国科学院软件研究所 | personal blog | 知识图谱会议报告合集 ❤️❤️❤️
  7. 知识图谱最新进展论文 ❤️❤️❤️
  8. 知识图谱应用Github仓库(长期更新) ❤️
  9. Knowledge-Graph github 📚
  10. Awesome-Deeplearning-NLP-Papers 📚
  11. Knowledge Graphs papers 📚
  12. Knowledge Graph Embedding papers 📚
  13. KRL: knowledge representation learning & KE: knowledge embedding must read papers 📚
  14. Knowledge Graph Reasoning Papers 📚
  15. Knowledge Graph Tutorials and Papers & Data ❤️
  16. Awesome Knowledge Graph papers & Data ❤️ 吕海平 The University of Sheffield machine learning, brain imaging, and tensor analysis
  17. Ontology Learning as Machine Translation Expressive ontology learning as neural machine translation,2018 把OL当做MT来做 ❤️
  18. AI/RL Ontology 一个强化学习领域的Ontology by WebProtege
  19. KnOWLearn-Tool 一个ontology learning 工具,包含 term extraction, data from different domain AuthorSamuel Vieyra ❤️
  20. Programming Exercises for the Analysis of Knowledge Graphs allows interested students and researchers to perform hands-on analysis of knowledge graphs Smart Data Analytics Github from Smart Data Analytics of University of Bonn, German ❤️
  21. NLP-Knowledge-Graph 自然语言处理、知识图谱、对话系统三大技术研究与应用 📚
  22. TextGrapher - 输入一篇文档,形成对文章语义信息的图谱化展示
  23. 知识图谱 (Knowledge Graph) 专知 荟萃 ❤️❤️

Coding Part

  1. Pytorch Tricks
  2. GPT2 Pytorch
  3. NLP tutorial ❤️❤️❤️ by Tae-Hwan Jung Seoul, South Korea
  4. ALBERT-Pytorch
  5. Transformer-Pytorch

Knowledge Graph

  • ldf.fi/service/rdf-grapher RDF 可视化(在线)
  • https://jena.apache.org/ 构建语义网和关联数据(或称链接数据)的开源框架
  • https://github.com/lambdamusic/Ontospy Python library and command-line interface for inspecting and visualizing RDF models.
  • S-Match S-Match is a semantic matching framework.
  • sematch semantic similarity framework for knowledge graph
  • Knowledge-Graph-Analysis-Programming-Exercises
  • https://github.com/chrisjmccormick/simsearch (最后更新 2016)
  • https://gate.ac.uk/ GATE,开源的文本处理平台
  • http://owlready.8326.n8.nabble.com/ Owlready2,基于Python 3、面向本体OWL的开源库

Embeddings

嵌入相关论文和代码 参考

Chinese-Word-Vectors

Date Model Name Paper Codes
浅层词嵌入
2013/01 Word2Vec Efficient Estimation of Word Representations in Vector Space C
2014 Glove GloVe: Global Vectors for Word Representation C
2014/05 Doc2Vec Distributed Representations of Sentences and Documents Pytorch
Python
2016/07 fastText Enriching Word Vectors with Subword Information C++
上下文嵌入 (大多数的预训练模型可以通过transformers库进行加载)
2018 GPT Improving Language Understanding by Generative Pre-Training TF
Keras
Pytorch, TF2.0
- GPT-2(117M, 124M, 345M, 355M, 774M, 1558M) Language Models are Unsupervised Multitask Learners TF
Pytorch, TF2.0
Keras
2018/02 ELMO(AllenNLP, TF-Hub) Deep contextualized word representations Pytorch
TF
2018/10 BERT(BERT, ERNIE, KoBERT) BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding TF
Keras
Pytorch, TF2.0
MXNet
PaddlePaddle
TF
Keras
2019/01 Transformer-XL Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context TF
Pytorch
Pytorch, TF2.0
2019/05 ERNIE ERNIE: Enhanced Language Representation with Informative Entities Pytorch img
2019/07 RoBERTa RoBERTa: A Robustly Optimized BERT Pretraining Approach Pytorch
Pytorch, TF2.0
2019/09 ALBERT ALBERT: A Lite BERT for Self-supervised Learning of Language Representations TF img
2019/10 DistilBERT DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter Pytorch, TF2.0
2019/10 T5 Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer TF img
2020/05 GPT3 Language Models are Few-Shot Learners https://github.com/openai/gpt-3

References