The theme of our project is a global dynamic migration flow. Besides for immigration itself, we also hope to bring the elements that drive people to migrate into consideration. The features we chose respectively are economic, social security situation and food security. In the end, we decided on four datasets in total. Our primary dataset is the migration dataset which the United Nation collects. The GDP dataset from the World Bank for economic factors, the crime/safety index dataset from the numbeo, and the global food security index dataset from the economist are chosen as complementary datasets.
The migration dataset contains 156,342 data points and five dimensions: origin, destination, year, number of female migration, and male migration. Data were collected at seven time points: 1990, 1995, 2000, 2005, 2010, 2015, and 2019. The data of 2015 and 2019 are discussed here as an example.
The mean of male immigration and female immigration in 2015 and 2019 are respectively illustrated below. We can observe that there is less male migration than female migration both in 2015 and 2019. Both the number of male migration and female migration decreased between 2015 and 2019. The decline of male migration is more severe than female migration.
The GDP dataset contains 12,398 data points. After filtering the data by year we hope to discuss (1990, 1995, 2000, 2005, 2010, 2015, and 2019) there are 1855 data points and three dimensions: country, year, and GDP.
The histogram of GDP per capita of 2015 and 2019 are respectively illustrated below. We can observe that most countries have a GDP lower than 25,000 dollars.
The crime rate dataset contains 1888 data points and four dimensions: country, year, crime index, and safety index.
The means of crime indices and safety indices of 2015 and 2019 are displayed below. We observe that the social security condition improves from 2015 to 2019.
To further discuss the crime index and safety index, the boxplots of crime indices and safety indices of 2015 and 2019 are displayed below.
The global food security index dataset contains 3955 data points and seven dimensions: country, year, overall score, affordability, availability, quality and safety, natural resources and resilience. We will mainly focus on quality and safety.
The histograms of food security in 2015 and 2019 are illustrated below. We can observe that they are basically the same. So, the food security conditions of 2015 and 2019 basically remain the same. But there might be some changes for each country which can not be observed with these general figures.