Open in app

Sign In

Write

Sign In

Xiaoou&NLP
Xiaoou&NLP

37 Followers

Home

About

Dec 1, 2021

Parallelization in Python: a beginner’s guide (1, using map)

Parallelization is an advanced function in Python that few beginners know how to use. — Parallelization is very useful in a lot of daily tasks, however tutorials often begin with obscure explanations about multithreading and multiprocessing. Don’t get me wrong, these concepts are crucial in complex scenarios, however they are intimidating and unnecessary for beginners. Let’s use a code-first and example-driven approach to introduce parallelization in Python.

NLP

2 min read

NLP

2 min read


Nov 18, 2021

Understanding the Levenshtein Distance Equation for Beginners
2.5K
19

Ethan Nam

fantastic efforts, however bear in mind that although the notion of distance was defined by…

fantastic efforts, however bear in mind that although the notion of distance was defined by Levenshtein, the algorithm was not: The minimum edit distance algorithm algorithm was named by Wagner and Fischer (1974) but independently discovered by many people (see the Historical Notes section of Chapter 8). See slp of Jurafsky

1 min read

1 min read


May 8, 2021

No, CamemBERT/FlauBERT doesn’t understand negation

Does CamemBERT/FlauBERT/Bert understand negation? Short answer: no. This notebook replicates a section of the paper of Ettinger2020 (see the reference at the end of this post) on negation using a French corpus. …

Camembert

5 min read

No, CamemBERT/FlauBERT doesn’t understand negation
No, CamemBERT/FlauBERT doesn’t understand negation
Camembert

5 min read


May 6, 2021

Playing with CamemBERT and FlauBERT

CamemBERT and FlauBERT are two pretrained language models based on the transformer architecture. Some people introduce these two models by using some frameworks like fast-bert. I’m not a fan of this approach because basically you’re just wasting you time typing some code that you don’t even understand. You end by…

Camembert

5 min read

Playing with CamemBERT and FlauBERT
Playing with CamemBERT and FlauBERT
Camembert

5 min read


Apr 21, 2021

Tmux: an introduction and most common operations

If you check my bash codebase, you’ll see that it’s possible to run a task/process in background with nohup and in the comment i said or use tumux. nohup commande # nonstop after disconnection/or use tumux So why using tumux? Well put simply, it makes your life easier. Detaching Tmux keeps all the windows and…

Tmux

1 min read

Tmux

1 min read


Published in

NLP in French/Tal en français

·Apr 14, 2021

Se connecter à un jupyter notebook à distance dans Pycharm

J’adore travailler avec Pycharm quand il s’agit des notebooks jupyter parce qu’il y a un excellent support pour les fonctionnalités courantes de l’IDE et un bon débogueur. Cependant, accéder à un notebook jupyter à distance est un casse-tête pour les débutants et la documentation est plutôt obscure, d’où ce tutoriel. …

Pycharm

1 min read

Se connecter à un jupyter notebook à distance dans Pycharm
Se connecter à un jupyter notebook à distance dans Pycharm
Pycharm

1 min read


Apr 14, 2021

Connect to remote jupyter notebook in Pycharm

I love working in Pycharm when it comes to jupyter notebooks because there is extensive support for common IDE features and a fantastic debugger. However accessing remote jupyter notebook is a headache for newbies and the documentation is rather obscure, thus this tutorial. First in your remote server, run jupyter notebook --generate-config Then use your favorite editor to edit the config file vim ~/.jupyter/jupyter_notebook_config.py

Pycharm

1 min read

Connect to remote jupyter notebook in Pycharm
Connect to remote jupyter notebook in Pycharm
Pycharm

1 min read


Apr 3, 2021

Understand objected-oriented programming (OOP) by building a minimal Web Scraping framework

Why you want to be a more organized programmer — What you are going to learn requests is a very popular package in Python because it provides many convenient methods to handle requests, parsing and exception handling. One could also use the official urllib package, however for the same tasks it is overall much easier to use requests due to its code design. …

Web Scraping

7 min read

Understand objected-oriented programming (OOP) by building a minimal Web Scraping framework in…
Understand objected-oriented programming (OOP) by building a minimal Web Scraping framework in…
Web Scraping

7 min read


Mar 27, 2021

On your way to scraping French forums

Be sure to have read the first tutorial here. Originally published at AIPrototypes.com. Get pages The construction of the scraper for the French forum doctissimo is actually simpler than lemonde website. Let's look at the link of the second page of the "pain dos" section as an example: https://forum.doctissimo.fr/sante/douleur-dos/liste_sujet-2.htm First I write…

Web Scraping

2 min read

Web Scraping

2 min read


Published in

NLP in French/Tal en français

·Mar 25, 2021

Classification de commentaires avec Camembert sans prise de tête : les fondamentaux

1. Motivation Camembert a été publié en juin 2020. Cependant force est de constater que l’emploi de Bert en français (Il s’agit plutôt de Roberta pour Camembert, voir 10 questions rapides sur Bert) n’est pas encore une tendance. …

Bert

4 min read

Classification de commentaires avec Camembert sans prise de tête : les fondamentaux
Classification de commentaires avec Camembert sans prise de tête : les fondamentaux
Bert

4 min read

Xiaoou&NLP

Xiaoou&NLP

37 Followers

Content creator at AIPrototypes.com :D

Following
  • John Au-Yeung

    John Au-Yeung

  • Kirshi Yin

    Kirshi Yin

  • Pierre Guillou

    Pierre Guillou

  • Nikos Kafritsas

    Nikos Kafritsas

  • Catherine Yeo

    Catherine Yeo

See all (26)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams