Archive

Archive for April, 2010

Retrieve URLs and Tags from Del.icio.us with Python API on Ubuntu

April 30, 2010 Leave a comment

For DeliciousAPI, please refer to Michael G. Noll web page. My previous post on DeliciousAPI would be useful too.

  1. Install Python on your machine. Most of the time, Ubuntu will come with Python already.
  2. Install “Easy Install” for Python by running terminal and type
    sudo apt-get install python-setuptools python-dev build-essential

    For more details, please refer to SaltyCrane Blog.

  3. Install DeliciousAPI by running
sudo easy_install DeliciousAPI

Now you can use DeliciousAPI on Python already!!! Just go to Michael’s web page then copy the demo code to a txt file, say test.py. Then go to the folder containing the file and run it from the terminal by typing “python test.py”. For more detail about how to install the API on Windows7, please refer to my previous post.

Retrieve URLs and Tags from Del.icio.us (Delicious.com)

April 30, 2010 2 comments

Del.icio.us is a bookmarking website. For some people who want to work on topic modeling, the website can be a good data set to try. In this post, I would like to show how to retrieve the information (e.g. URLs, title, tags, users, comments, timestamps ) from Del.icio.us. There are so many way to do the job, but, for me, I think Python API is a good and easy way to do this.

Actually the whole API is available for free and described very well in Del.icio.us Python API web page by Michael G. Noll. However, some of us who is an absolute beginner may not understand how to install and use the API, so I would like to elaborate Michael’s guide in more detail. There are 4 big steps, here we go!

  1. Install Python: First of all, you will have to have Python engine installed in your machine
  2. Install Easy Install: Easy Install is a Python package that can save us a lot of time when installing any Python package.
  3. Install Michael’s Del.icio.us Python API on your machine
  4. Run the API, and have fun!

1. Install Python on your machine Windows 7 64-bit

  1. Download Python engine from the Python download page. Pick the installer that matches your machine and OS. For me, I have windows7 64-bit, so I will download “Python 2.6.5 Windows X86-64 installer (Windows AMD64 / Intel 64 / X86-64 binary [1] — does not include source)”
  2. Install the file on your machine, it should not take too long to download and install it on your machine. I found a good video tutorial on the Python installation and testing.
  3. Now you can play with Python IDLE to see if you install it properly.
  4. Add path to the Python folder
    1. Run command line as an administrator, please refer to this blog.
    2. add path by typing “set path=C:\Program Files\yourPythonFolder;%path%”. Note that yourPythonFolder MUST contain the file python.exe
    3. You can check if the path is included properly by typing “path”

2. Install Easy Install

For simplicity, we will next install a Python package called “Easy Install” which can save a lot of our time when install additional Python package. Easy Install will monitor the installation and can automatically download and install additional package needed. This way we don’t have to manually check what package to download and install.

  1. Please go to Easy Install web page, and click download the proper installation file. Note that if you use Windows7 64-bit, the only option you may use is the “Source” file (setuptools-0.6c11.tar.gz). One good thing about using the source file is that it always works regardless of what kind of machine or OS you are using…so I will go this way.
  2. Download the source and extract the file on a folder, say C:\Users\bot\Downloads\PythonFiles\setuptools-0.6c11
  3. Run command line as an administrator, and go to the folder C:\Users\bot\Downloads\PythonFiles\setuptools-0.6c11
  4. Now we have to install the Easy Install package by running the command “python setup.py install”. You will see so many things going on in the command line.
  5. You will find that Easy Install would be stored in the folder “C:\Program Files\Python 2.6.5 64-bit\Scripts”

3. Install Michael’s Del.icio.us Python API

We will use Easy Install to do the job

  1. Stay in the command line, go to the folder “C:\Program Files\Python 2.6.5 64-bit\Scripts” by typing “cd C:\Program Files\Python 2.6.5 64-bit\Scripts”
  2. Run the command “easy_install DeliciousAPI“, you will see Python downloading packages necessary for DeliciousAPI.

4. Run DeliciousAPI

Now that we installed the API already, now let’s run it.

  1. Go to Michael’s page, copy the demo code, save it as “test.py” and put it in any folder you want.
  2. You can run the test.py from command line by typing “python test.py”. You should be able to see the URLs, tags, everything pop up on your command line panel!

I would like to thank my friend Rohit Manokaran for helping me with this DeliciousAPI.

Topics in Computer Vision

April 24, 2010 Leave a comment

Good overview 5-hour video lecture for computer vision [link]

Learning in Computer Vision

Simon Lucey

7 videos

There are some topics that are often referred to in computer vision publications. Here is my attempt to compile a list of good resources and tutorials:

  • SIFT: there are some good tutorial websites: [link], good tutorial paper [link]
  • SURF: link to the official website [link], the revised paper [link], wiki [link]
  • Bags of keypoints [link]
  • The Pyramid Match Kernel [link]
  • Automatic image annotation

LyX – “No information for exporting the format PDF”

April 21, 2010 1 comment

If you encounter the error message “No information for exporting the format PDF (or PS or DVI)from LyX, you may check the followings:

  1. Check if MikTeX is installed in your machine already
  2. Make sure that LyX links to the correct folder in the MikTeX which is “C:\Program Files\MiKTeX 2.8\miktex\bin”
  3. One easy way is to uninstall and reinstall LyX
  4. If it still does not work, then go to the main menu Tools>Reconfigure. This should work, but might take some time.
Categories: Tutorials Tags:

Speech Processing and Machine Recognition of Speech

April 19, 2010 Leave a comment

Speech processing has been an active area of research for decades. By nature, the research is highly multidisciplinary as it involves linguistics, psychology, anatomy, mathematics, and machine learning, so some newbies on the area might wonder what would be a good tutorial to read in order to catch up with the topic. Personally, without any clue about linguistics or psychology, I started with a classic HMMs paper [1] by Rabiner. However, this paper focuses on HMM part of speech recognition rather than the overview of speech processing.  Today I found a series of good video lectures, and I thought that it might help summarize what people have been doing in the area. Hopefully you find it useful.

  • Hidden Markov Model (HMM) Toolbox for Matlab by Kevin Murphy [link]: The web page links to many useful papers, and provides HMM toolbox for MATLAB.
  • JHU Summer School on Human Language Technology [link]: The web page contains a lot of video lectures hosted by videolectures.net
  • Here is the lecture by Hynek Hermansky, you may want to watch this first

Introduction to Speech Processing

Hynek Hermansky

Next, you may want to know more technical details

Machine Recognition of Speech

Brian Kingsbury

Recommended Reading:

[1] A tutorial on Hidden Markov Models and selected applications in speech recognition, L. Rabiner, 1989, Proc. IEEE 77(2):257–286. [pdf]