{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Ejemplos Pandas\n", "\n", "\n", "## Base de datos ESRU-EMOVI 2017\n", "\n", "\n", "\n", "Por primera vez y gracias al enfoque territorial de la ESRU – EMOVI 2017, es posible medir la movilidad social por regiones. La encuesta de 2017, al igual que las anteriores, tiene como finalidad principal medir la movilidad social intergeneracional. Cuenta con 17,665 entrevistas y es representativa de hombres y mujeres entre 25 y 64 años a nivel nacional, para la Ciudad de México y cinco regiones del país: norte, norte-occidente, centro, centro-norte y sur. Los objetivos de la ESRU-EMOVI 2017 son:\n", "\n", "- Contar con información actualizada en las distintas dimensiones de la movilidad social a nivel nacional.\n", "\n", "- Generar estimaciones de movilidad para cinco regiones del país y la Ciudad de México.\n", "\n", "- Analizar los patrones de movilidad social desde la perspectiva de la desigualdad de oportunidades.\n", "\n", "Encuesta financiada por la Fundación ESRU. [CEEY](https://ceey.org.mx/contenido/que-hacemos/emovi/) \n", "\n" ] }, { "cell_type": "code", "execution_count": 26, "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "import matplotlib.pyplot as plt\n", "import numpy as np\n", "import seaborn as sns\n", "import string" ] }, { "cell_type": "code", "execution_count": 23, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "folio object\n", "Estado float64\n", "folio_ageb object\n", "consecutivo object\n", "Origen float64\n", " ... \n", "cmo2_2 object\n", "cmo3_2 object\n", "cmo4_2 object\n", "cmo5_2 object\n", "tamhog float64\n", "Length: 366, dtype: object" ] }, "execution_count": 23, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df = pd.read_stata('ESRU-EMOVI-2017-Entrevistado.dta',\n", " convert_categoricals= False\n", " )\n", "df.dtypes" ] }, { "cell_type": "code", "execution_count": 25, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", " | folio | \n", "Estado | \n", "folio_ageb | \n", "consecutivo | \n", "Origen | \n", "Latitud | \n", "Longitud | \n", "LatitudGP | \n", "LongitudGP | \n", "recontacto | \n", "... | \n", "region | \n", "cdmx | \n", "tot_int | \n", "rururb | \n", "cmo1_2 | \n", "cmo2_2 | \n", "cmo3_2 | \n", "cmo4_2 | \n", "cmo5_2 | \n", "tamhog | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0100100010286020830102 | \n", "1.0 | \n", "0100100010286 | \n", "1 | \n", "1.0 | \n", "21.901323 | \n", "-102.310598 | \n", "21.901477 | \n", "-102.310429 | \n", "2 | \n", "... | \n", "3.0 | \n", "NaN | \n", "5.0 | \n", "0.0 | \n", "41 | \n", ". | \n", ". | \n", "13 | \n", ". | \n", "5.0 | \n", "
1 | \n", "0100100010286020850201 | \n", "1.0 | \n", "0100100010286 | \n", "1 | \n", "2.0 | \n", "21.901323 | \n", "-102.310598 | \n", "21.900773 | \n", "-102.311138 | \n", "1 | \n", "... | \n", "3.0 | \n", "NaN | \n", "1.0 | \n", "0.0 | \n", "41 | \n", ". | \n", ". | \n", "41 | \n", ". | \n", "1.0 | \n", "
2 | \n", "0100100010286025830201 | \n", "1.0 | \n", "0100100010286 | \n", "1 | \n", "1.0 | \n", "21.900830 | \n", "-102.311818 | \n", "21.900549 | \n", "-102.313361 | \n", "1 | \n", "... | \n", "3.0 | \n", "NaN | \n", "2.0 | \n", "0.0 | \n", "81 | \n", ". | \n", ". | \n", "11 | \n", ". | \n", "2.0 | \n", "
3 | \n", "0100100010286025840101 | \n", "1.0 | \n", "0100100010286 | \n", "1 | \n", "1.0 | \n", "21.901188 | \n", "-102.310700 | \n", "21.900765 | \n", "-102.313144 | \n", "1 | \n", "... | \n", "3.0 | \n", "NaN | \n", "1.0 | \n", "0.0 | \n", "52 | \n", ". | \n", ". | \n", ". | \n", ". | \n", "1.0 | \n", "
4 | \n", "0100100010286025850101 | \n", "1.0 | \n", "0100100010286 | \n", "1 | \n", "2.0 | \n", "21.901188 | \n", "-102.310700 | \n", "21.900577 | \n", "-102.312733 | \n", "1 | \n", "... | \n", "3.0 | \n", "NaN | \n", "2.0 | \n", "0.0 | \n", "52 | \n", ". | \n", ". | \n", ". | \n", ". | \n", "2.0 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
17660 | \n", "3205700010022019460402 | \n", "32.0 | \n", "3205700010022 | \n", "2 | \n", "1.0 | \n", "22.755409 | \n", "-102.513985 | \n", "22.755409 | \n", "-102.513985 | \n", "1 | \n", "... | \n", "2.0 | \n", "NaN | \n", "4.0 | \n", "0.0 | \n", "71 | \n", ". | \n", ". | \n", ". | \n", "71 | \n", "4.0 | \n", "
17661 | \n", "3205700010022025450501 | \n", "32.0 | \n", "3205700010022 | \n", "2 | \n", "1.0 | \n", "22.288405 | \n", "-101.577532 | \n", "22.288405 | \n", "-101.577532 | \n", "1 | \n", "... | \n", "2.0 | \n", "NaN | \n", "4.0 | \n", "0.0 | \n", ". | \n", ". | \n", ". | \n", "82 | \n", "71 | \n", "4.0 | \n", "
17662 | \n", "3205700010022025460301 | \n", "32.0 | \n", "3205700010022 | \n", "2 | \n", "1.0 | \n", "22.758625 | \n", "-102.499375 | \n", "22.758625 | \n", "-102.499375 | \n", "1 | \n", "... | \n", "2.0 | \n", "NaN | \n", "6.0 | \n", "0.0 | \n", ". | \n", ". | \n", ". | \n", "53 | \n", "52 | \n", "6.0 | \n", "
17663 | \n", "3205700010022025460302 | \n", "32.0 | \n", "3205700010022 | \n", "2 | \n", "1.0 | \n", "22.755420 | \n", "-102.513997 | \n", "22.755420 | \n", "-102.513997 | \n", "1 | \n", "... | \n", "2.0 | \n", "NaN | \n", "5.0 | \n", "0.0 | \n", ". | \n", "52 | \n", ". | \n", "53 | \n", "62 | \n", "5.0 | \n", "
17664 | \n", "3205700010022025460501 | \n", "32.0 | \n", "3205700010022 | \n", "7 | \n", "1.0 | \n", "22.758625 | \n", "-102.499375 | \n", "22.758625 | \n", "-102.499375 | \n", "1 | \n", "... | \n", "2.0 | \n", "NaN | \n", "10.0 | \n", "0.0 | \n", ". | \n", ". | \n", ". | \n", ". | \n", "41 | \n", "10.0 | \n", "
17665 rows × 366 columns
\n", "\n", " | Estado | \n", "p05 | \n", "p06 | \n", "p13 | \n", "SINCO3 | \n", "
---|---|---|---|---|---|
count | \n", "3699.0 | \n", "3699.0 | \n", "3699 | \n", "3699.0 | \n", "3699 | \n", "
unique | \n", "32.0 | \n", "40.0 | \n", "2 | \n", "13.0 | \n", "305 | \n", "
top | \n", "9.0 | \n", "40.0 | \n", "1 | \n", "2.0 | \n", "4111 | \n", "
freq | \n", "600.0 | \n", "146.0 | \n", "2254 | \n", "943.0 | \n", "489 | \n", "
\n", " | folioviv | \n", "foliohog | \n", "numren | \n", "clave | \n", "mes_1 | \n", "mes_2 | \n", "mes_3 | \n", "mes_4 | \n", "mes_5 | \n", "mes_6 | \n", "ing_1 | \n", "ing_2 | \n", "ing_3 | \n", "ing_4 | \n", "ing_5 | \n", "ing_6 | \n", "ing_tri | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0100003801 | \n", "1 | \n", "02 | \n", "P009 | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | 7500.0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "1844.26 | \n", "
1 | \n", "0100003801 | \n", "1 | \n", "01 | \n", "P001 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "04 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "53114.75 | \n", "
2 | \n", "0100003801 | \n", "1 | \n", "02 | \n", "P001 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "04 | \n", "15000.0 | \n", "15000.0 | \n", "15000.0 | \n", "15000.0 | \n", "15000.0 | \n", "15000.0 | \n", "44262.29 | \n", "
3 | \n", "0100003801 | \n", "1 | \n", "01 | \n", "P009 | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | 6000.0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "1475.40 | \n", "
4 | \n", "0100003802 | \n", "1 | \n", "02 | \n", "P040 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "5000.0 | \n", "0.0 | \n", "2459.01 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
334332 | \n", "3260801906 | \n", "1 | \n", "02 | \n", "P022 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "0.0 | \n", "0.0 | \n", "2200.0 | \n", "2000.0 | \n", "2000.0 | \n", "2000.0 | \n", "4010.86 | \n", "
334333 | \n", "3260801906 | \n", "1 | \n", "02 | \n", "P053 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "500.0 | \n", "0.0 | \n", "300.0 | \n", "200.0 | \n", "0.0 | \n", "300.0 | \n", "635.86 | \n", "
334334 | \n", "3260801906 | \n", "1 | \n", "04 | \n", "P014 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "1080.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "528.26 | \n", "
334335 | \n", "3260801906 | \n", "1 | \n", "04 | \n", "P001 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "1200.0 | \n", "500.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "0.0 | \n", "831.52 | \n", "
334336 | \n", "3260801906 | \n", "1 | \n", "01 | \n", "P053 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "700.0 | \n", "300.0 | \n", "200.0 | \n", "1200.0 | \n", "200.0 | \n", "200.0 | \n", "1369.56 | \n", "
334337 rows × 17 columns
\n", "\n", " | folioviv | \n", "foliohog | \n", "numren | \n", "clave | \n", "mes_1 | \n", "mes_2 | \n", "mes_3 | \n", "mes_4 | \n", "mes_5 | \n", "mes_6 | \n", "ing_1 | \n", "ing_2 | \n", "ing_3 | \n", "ing_4 | \n", "ing_5 | \n", "ing_6 | \n", "ing_tri | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | \n", "0100003801 | \n", "1 | \n", "01 | \n", "P001 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "04 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "53114.75 | \n", "
3 | \n", "0100003801 | \n", "1 | \n", "01 | \n", "P009 | \n", "\n", " | \n", " | \n", " | \n", " | \n", " | \n", " | 6000.0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "1475.40 | \n", "
8 | \n", "0100003802 | \n", "1 | \n", "01 | \n", "P001 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "47213.11 | \n", "
10 | \n", "0100003803 | \n", "1 | \n", "01 | \n", "P001 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "04 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "82622.95 | \n", "
13 | \n", "0100003804 | \n", "1 | \n", "01 | \n", "P001 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "04 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "29508.19 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
334328 | \n", "3260801904 | \n", "1 | \n", "01 | \n", "P072 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "1750.0 | \n", "1750.0 | \n", "1750.0 | \n", "1750.0 | \n", "1750.0 | \n", "1750.0 | \n", "5135.86 | \n", "
334329 | \n", "3260801905 | \n", "1 | \n", "01 | \n", "P032 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "17608.69 | \n", "
334330 | \n", "3260801905 | \n", "1 | \n", "01 | \n", "P044 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "1100.0 | \n", "0.0 | \n", "1100.0 | \n", "0.0 | \n", "1100.0 | \n", "0.0 | \n", "1614.13 | \n", "
334331 | \n", "3260801906 | \n", "1 | \n", "01 | \n", "P001 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "2200.0 | \n", "2200.0 | \n", "2200.0 | \n", "2000.0 | \n", "2000.0 | \n", "2000.0 | \n", "6163.04 | \n", "
334336 | \n", "3260801906 | \n", "1 | \n", "01 | \n", "P053 | \n", "10 | \n", "09 | \n", "08 | \n", "07 | \n", "06 | \n", "05 | \n", "700.0 | \n", "300.0 | \n", "200.0 | \n", "1200.0 | \n", "200.0 | \n", "200.0 | \n", "1369.56 | \n", "
151177 rows × 17 columns
\n", "\n", " | folioviv | \n", "foliohog | \n", "numren | \n", "ing_1 | \n", "ing_2 | \n", "ing_3 | \n", "ing_4 | \n", "ing_5 | \n", "ing_6 | \n", "ing_tri | \n", "claves | \n", "ing_men | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0100003801 | \n", "1 | \n", "01 | \n", "24000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "54590.15 | \n", "P001P009 | \n", "18196.716667 | \n", "
1 | \n", "0100003802 | \n", "1 | \n", "01 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "47213.11 | \n", "P001 | \n", "15737.703333 | \n", "
2 | \n", "0100003803 | \n", "1 | \n", "01 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "82622.95 | \n", "P001 | \n", "27540.983333 | \n", "
3 | \n", "0100003804 | \n", "1 | \n", "01 | \n", "15000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "30737.69 | \n", "P001P009 | \n", "10245.896667 | \n", "
4 | \n", "0100003805 | \n", "1 | \n", "01 | \n", "40000.0 | \n", "12000.0 | \n", "12000.0 | \n", "12000.0 | \n", "12000.0 | \n", "12000.0 | \n", "42295.07 | \n", "P001P009 | \n", "14098.356667 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
67814 | \n", "3260438625 | \n", "2 | \n", "01 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "11803.27 | \n", "P001 | \n", "3934.423333 | \n", "
67815 | \n", "3260547718 | \n", "2 | \n", "01 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "17704.91 | \n", "P001 | \n", "5901.636667 | \n", "
67816 | \n", "3260547909 | \n", "2 | \n", "01 | \n", "1700.0 | \n", "750.0 | \n", "1700.0 | \n", "3200.0 | \n", "4150.0 | \n", "3200.0 | \n", "7190.21 | \n", "P042P022 | \n", "2396.736667 | \n", "
67817 | \n", "3260601317 | \n", "2 | \n", "01 | \n", "950.0 | \n", "0.0 | \n", "950.0 | \n", "0.0 | \n", "950.0 | \n", "0.0 | \n", "1394.02 | \n", "P042 | \n", "464.673333 | \n", "
67818 | \n", "3260610709 | \n", "2 | \n", "01 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "10622.95 | \n", "P001 | \n", "3540.983333 | \n", "
67819 rows × 12 columns
\n", "\n", " | folioviv | \n", "foliohog | \n", "numren | \n", "ing_1 | \n", "ing_2 | \n", "ing_3 | \n", "ing_4 | \n", "ing_5 | \n", "ing_6 | \n", "ing_tri | \n", "claves | \n", "ing_men | \n", "cohort | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "0100003801 | \n", "1 | \n", "01 | \n", "24000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "18000.0 | \n", "54590.15 | \n", "P001P009 | \n", "18196.716667 | \n", "C5 | \n", "
1 | \n", "0100003802 | \n", "1 | \n", "01 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "16000.0 | \n", "47213.11 | \n", "P001 | \n", "15737.703333 | \n", "C5 | \n", "
2 | \n", "0100003803 | \n", "1 | \n", "01 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "28000.0 | \n", "82622.95 | \n", "P001 | \n", "27540.983333 | \n", "C6 | \n", "
3 | \n", "0100003804 | \n", "1 | \n", "01 | \n", "15000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "10000.0 | \n", "30737.69 | \n", "P001P009 | \n", "10245.896667 | \n", "C4 | \n", "
4 | \n", "0100003805 | \n", "1 | \n", "01 | \n", "40000.0 | \n", "12000.0 | \n", "12000.0 | \n", "12000.0 | \n", "12000.0 | \n", "12000.0 | \n", "42295.07 | \n", "P001P009 | \n", "14098.356667 | \n", "C5 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
67814 | \n", "3260438625 | \n", "2 | \n", "01 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "4000.0 | \n", "11803.27 | \n", "P001 | \n", "3934.423333 | \n", "C2 | \n", "
67815 | \n", "3260547718 | \n", "2 | \n", "01 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "6000.0 | \n", "17704.91 | \n", "P001 | \n", "5901.636667 | \n", "C3 | \n", "
67816 | \n", "3260547909 | \n", "2 | \n", "01 | \n", "1700.0 | \n", "750.0 | \n", "1700.0 | \n", "3200.0 | \n", "4150.0 | \n", "3200.0 | \n", "7190.21 | \n", "P042P022 | \n", "2396.736667 | \n", "C1 | \n", "
67817 | \n", "3260601317 | \n", "2 | \n", "01 | \n", "950.0 | \n", "0.0 | \n", "950.0 | \n", "0.0 | \n", "950.0 | \n", "0.0 | \n", "1394.02 | \n", "P042 | \n", "464.673333 | \n", "C1 | \n", "
67818 | \n", "3260610709 | \n", "2 | \n", "01 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "3600.0 | \n", "10622.95 | \n", "P001 | \n", "3540.983333 | \n", "C2 | \n", "
67819 rows × 13 columns
\n", "