Sanbercode
Python
Data Science
HELP International is an international non-governmental organization committed to fighting poverty and providing basic facilities and assistance to communities in underdeveloped countries during times of disaster and natural disasters. HELP International has successfully raised around $10 million. Now, the CEO of HELP International needs to decide how to use the money strategically and effectively. Therefore, the CEO must make a decision on which country most needs aid.
Objective - Categorizing countries based on social, economic, and health factors that determine overall country development as a basis for recommending which countries have the highest urgency to receive aid from HELP International.
The level of welfare of a country is influenced by socio-economic, health, physical & environmental, legal, and the potential of its citizens. Generally, socio-economic and health factors are often the main considerations in determining development decisions because these two factors have a large impact and can be felt directly by all layers of citizens.
Features Selection - features that will be used as the basis for further analysis and clustering are GDP per capita and Life Expectancy. GDP per capita can reveal the level of prosperity of a country and its people, and the life expectancy rate is a key indicator in assessing the health status of a country. No missing values were found, so the dataframe can be directly processed for further analysis. Also, handling outliers for preparing analysis.
The figure explains the clustering results. Based on the value of n, three groups (clusters) are generated. (1-Red) data with low GDP per capita and life expectancy. (2-Olive) group of data with low GDP per capita but high life expectancy. (3-Green) group of data with high GDP per capita and high life expectancy.