Here in this post, I am going to show you on how to import IMDb plain text data files available at into your MySQL database server. I was using this data set to perform various benchmark tests around a moderately-large database size. If you want to have a small MySQL database sample data, you can try to use Sakila or World database available. I will be using CentOS 6.3 64bit as the OS platform and presume that MySQL has already installed and running.
Make sure that you are having at least 4 GB of free space in the partition that MySQL @@datadir reside to allow this long-running process (3 to 5 hours – depending on your hardware specs + server workload) to successfully complete. We will be using a Python-based package called. This package requires you to have Python with some development libraries installed as well as. Then, we will need to download all data files from IMDb mirror site:, create the database and start the import process. Installing IMDbPy and SQLObject 1. Install required packages using yum.
Jump to Merging IMDB database topic map layers - Now you should have all IMDB data files extracted. In Terry Gilliam's director table and topic Twelve.
SCANNING movies: Last Sunset (2006) (movieID: 2130001) SCANNING movies: Legend of Hell (2012) (movieID: 2140001) SCANNING movies: Lifestyles of Squirrels (2011) (movieID: 2150001) SCANNING movies: Los signos del tiempo (1983) (movieID: 2160001) SCANNING movies: Madame T (2012) (movieID: 2170001) SCANNING movies: Marijji ringu (2007) (movieID: 2180001) SCANNING movies: Menculik miyabi (2010) (movieID: 2190001). FLUSHING MoviesCache.
SCANNING movies: Last Sunset (2006) (movieID: 2130001) SCANNING movies: Legend of Hell (2012) (movieID: 2140001) SCANNING movies: Lifestyles of Squirrels (2011) (movieID: 2150001) SCANNING movies: Los signos del tiempo (1983) (movieID: 2160001) SCANNING movies: Madame T (2012) (movieID: 2170001) SCANNING movies: Marijji ringu (2007) (movieID: 2180001) SCANNING movies: Menculik miyabi (2010) (movieID: 2190001). FLUSHING MoviesCache. Wait up until it finish and you will have large sample data to play around in your MySQL server!