#Posts

Performing Sequence Labelling using CRF in Python

May 23, 2017  

Sequence Labelling in NLP In natural language processing, it is a common task to extract words or phrases of particular types from a given sentence or paragraph. For example, when performing analysis of a corpus of news articles, we may want to know which countries are mentioned in the articles, and how many articles are related to each of these countries. This is actually a special case of sequence labelling in NLP (others include POS tagging and Chunking), in which the goal is to assign a label to each member in the sequence.

...More

Matrix Factorization: A Simple Tutorial and Implementation in Python

Apr 23, 2017  

(This is an updated version of the article published on my previous personal Website and quuxlab) There is probably no need to say that there is too much information on the Web nowadays. Search engines help us a little bit. What is better is to have something interesting recommended to us automatically without asking. Indeed, from as simple as a list of the most popular questions and answers on Quora to some more personalized recommendations we received on Amazon, we are usually offered recommendations on the Web.

...More

Deploying Jupyter in Ubuntu with Nginx and Supervisor

Mar 21, 2017  

The IPython Notebook, now called Jupyter Notebook, is a convenient and interactive Web application for fast prototyping and testing ideas in Python (and R, Julia , Scala, and others) in the Web browser. Installing it on Ubuntu is easy, but it takes a little bit more effort to deploy it on a server and have it run as a service. This article serves as a simple guide to deploy Jupyter in a Ubuntu server, using the Nginx Web server and the supervisor system.

...More

Location and Friendship: Data Mining in Facebook

Sep 5, 2010  

In the past, studying social issues such as the mobility of a group of people generally required a huge amount of effort. Questionnaires would have had to be prepared, distributed, and collected after they were filled in. It was and still is a labor-intensive task when face-to-face interviews are required to obtain various personal data. Nowadays, we have more and more people connected to the Internet, and many of these Internet users participate in various kinds of social interactions on the Web.

...More