T3: ENDIREH 2021

Previo

Descarga el proyecto desde acá

En esta liga puedes descarga el proyecto de trabajo. De esta manera no tendremos problemas con las rutas relativas.

Video de la sesión

Paquetes

if (!require("pacman")) install.packages("pacman") # instala pacman si se requiere

Loading required package: pacman

pacman::p_load(tidyverse,
               skimr,
               haven, 
               readr,
               foreign,
               janitor,
               magrittr,
               pollster,
               srvyr,
               sjlabelled) #carga los paquetes necesarios

En esta práctica trabajaremos de nuevo con los datos abiertos.

¿Qué son datos abiertos?

https://publications.iadb.org/publications/spanish/viewer/Los-datos-abiertos-en-América-Latina-y-el-Caribe.pdf

Introducción a la fuente

De acuerdo a la información en los datos abiertos, hay 28 conjuntos de datos, uno a nivel vivienda y el resto a nivel individuo:

Vivienda: tviv
Individual: tsdem - toda la población
Individual - persona elegida: desde la sección III

Es muy raro que analicemos TODA la base de datos. Trabajaremos con el fusionado de vivienda, demográfico, sección III y sección de violencia en al ámbito laboral.

Pero… vamos a automatizar el proceso. Para eso primero repasemos sobre funciones y bucles

Mi primera función

Unos de los elementos más poderosos de R es hacer nuestra propias funciones.

La lógica de las funciones es la siguiente:

nombre_de_funcion(argumento1, argumento2, ...) {
  operaciones
  salida
}

Para ello haremos una función sencilla. Para sumarle un valor un 1

mi_funcion<-function(x) {
    resultado<-x+1
    return(resultado)
}

mi_funcion(5)

[1] 6

Vamos a agregar un argumento, podemos agregar un segundo número en lugar de 1

mi_funcion<-function(x, a) {
    resultado<-x+a
    return(resultado)
}

mi_funcion(x=5, a=6)

[1] 11

Los argumentos no necesariamente deben ser un sólo valor

mi_funcion(x=1:5, a=6:10)

[1]  7  9 11 13 15

Bucles

`for()`

Supongamos que quisiéramos repetir una operación a lo largo de una secuencia, se puede realizar

for (i in secuencia) {
  operación 1
  operación 2
  ...
  operación final
}

for(i in 1:10) {
  print(i+1)
}

[1] 2
[1] 3
[1] 4
[1] 5
[1] 6
[1] 7
[1] 8
[1] 9
[1] 10
[1] 11

for(j in c("Hugo", "Paco", "Luis")) {
  
  x<-paste("hola",j, sep=" ")
  print(x)
}

[1] "hola Hugo"
[1] "hola Paco"
[1] "hola Luis"

Importación de los datos

La estructura de archivos es muy consistente en los datos abiertos. Vamos a utilizar además una función llamada “paste()” que ayuda a pegar cadenas, que la había mostrado arriba

a<-"Hola"

b<-"¿Cómo estás?"

paste(a, b, sep=" ")

[1] "Hola ¿Cómo estás?"

Esto será muy útil para crear nuestra función

Por ejemplo revisemos la liga para importar los datos de la tabla “TVIV”

TVIV <- read_csv("data_t3/conjunto_de_datos_endireh_2021_csv/conjunto_de_datos_TVIV/conjunto_de_datos/conjunto_de_datos_TVIV.csv")

Rows: 122646 Columns: 35
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (16): ID_VIV, UPM, VIV_SEL, CVE_ENT, NOM_ENT, CVE_MUN, NOM_MUN, COD_RES,...
dbl (19): P1_1, P1_4_1, P1_4_2, P1_4_3, P1_4_4, P1_4_5, P1_4_6, P1_4_7, P1_4...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Hay dos espacios donde va el nombre de la tabla

cd0 <- "data_t3/conjunto_de_datos_endireh_2021_csv/conjunto_de_datos_"
cd1 <- "/conjunto_de_datos/conjunto_de_datos_"

Ejemplo de cómo funciona paste() y paste0()

paste0(cd0,"TVIV", cd1, "TVIV", ".csv")

[1] "data_t3/conjunto_de_datos_endireh_2021_csv/conjunto_de_datos_TVIV/conjunto_de_datos/conjunto_de_datos_TVIV.csv"

Podemos crear una función de importación:

importar <- function(tabla){

  path <- paste0(cd0,tabla, cd1,tabla, ".csv")
  
  x<-readr::read_csv(path, locale = locale(encoding = "latin1"))
  
  return(x)
}

TVIV<-importar("TVIV")

Rows: 122646 Columns: 35
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (16): ID_VIV, UPM, VIV_SEL, CVE_ENT, NOM_ENT, CVE_MUN, NOM_MUN, COD_RES,...
dbl (19): P1_1, P1_4_1, P1_4_2, P1_4_3, P1_4_4, P1_4_5, P1_4_6, P1_4_7, P1_4...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Ojo esta función también depende de cd0 y cd1, por lo que habría que declararlo antes. O bien, incluye los objetos dentro de la función, como lo hicimos con path. Nota que ese objeto no está en nuestro ambiente.

Vamos a usar el el índice de las tablas para importar todas con un loop

indice_tablas <- read_csv("data_t3/conjunto_de_datos_endireh_2021_csv/0_indice_tablas_ENDIREH_2021.csv",locale = locale(encoding = "latin1"), skip = 1) %>%
  clean_names()

Rows: 28 Columns: 2
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): Nombre de archivo, Título de Tablas

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Revisemos este índice

head(indice_tablas)

# A tibble: 6 × 2
  nombre_de_archivo            titulo_de_tablas                                 
  <chr>                        <chr>                                            
1 conjunto_de_datos_TVIV       Características de la Vivienda y Hogares en la V…
2 conjunto_de_datos_TSDem      Características Sociodemográficas de Residentes …
3 conjunto_de_datos_TB_SEC_III Elegibilidad y Verificación de Situación Conyuga…
4 conjunto_de_datos_TB_SEC_IV  Situación de la Relación de Pareja/ Ingresos y R…
5 conjunto_de_datos_TB_SEC_V   Consentimiento y Privacidad                      
6 conjunto_de_datos_TB_SEC_VI  Opinión Sobre los Roles Masculinos y Femeninos

indice_tablas %<>%
  mutate(nombre_de_archivo=stringr::str_remove_all(nombre_de_archivo, "conjunto_de_datos_"))
head(indice_tablas)

# A tibble: 6 × 2
  nombre_de_archivo titulo_de_tablas                                            
  <chr>             <chr>                                                       
1 TVIV              Características de la Vivienda y Hogares en la Vivienda     
2 TSDem             Características Sociodemográficas de Residentes de la Vivie…
3 TB_SEC_III        Elegibilidad y Verificación de Situación Conyugal de la Muj…
4 TB_SEC_IV         Situación de la Relación de Pareja/ Ingresos y Recursos     
5 TB_SEC_V          Consentimiento y Privacidad                                 
6 TB_SEC_VI         Opinión Sobre los Roles Masculinos y Femeninos

Vamos a importar todos los conjuntos!

Creamos un vector con las tablas que queremos importar:

tablas<-indice_tablas$nombre_de_archivo[c(1:4, 9)]

Si le quitas lo que hay entre corchetes, tendrías los 28 conjuntos de datos.

for(i in tablas) {
  
  y<-importar(i) # se importa la base
  assign(paste(i), y) # asigna el nombre al objeto y
}

Rows: 122646 Columns: 35
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (16): ID_VIV, UPM, VIV_SEL, CVE_ENT, NOM_ENT, CVE_MUN, NOM_MUN, COD_RES,...
dbl (19): P1_1, P1_4_1, P1_4_2, P1_4_3, P1_4_4, P1_4_5, P1_4_6, P1_4_7, P1_4...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 432746 Columns: 37
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (21): ID_VIV, ID_PER, UPM, VIV_SEL, CVE_ENT, NOM_ENT, CVE_MUN, NOM_MUN, ...
dbl (16): HOGAR, SEXO, GRA, P2_8, P2_9, P2_10, P2_11, P2_12, P2_13, P2_15, P...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 110127 Columns: 25
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (14): ID_VIV, ID_PER, UPM, VIV_SEL, CVE_ENT, NOM_ENT, CVE_MUN, NOM_MUN, ...
dbl (11): HOGAR, P3_1, P3_2, P3_3, P3_4, P3_5, P3_6, P3_7, FAC_VIV, FAC_MUJ,...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Warning: One or more parsing issues, call `problems()` on your data frame for details,
e.g.:
  dat <- vroom(...)
  problems(dat)

Rows: 110127 Columns: 75
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (34): ID_VIV, ID_PER, UPM, VIV_SEL, N_REN, DOMINIO, CVE_ENT, NOM_ENT, CV...
dbl (39): HOGAR, P4AB_1, P4A_1, P4A_2, P4B_1, P4B_2, P4BC_1, P4C_1, P4BC_3, ...
lgl  (2): P4_10_2_3, P4_10_3_3

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Warning: One or more parsing issues, call `problems()` on your data frame for details,
e.g.:
  dat <- vroom(...)
  problems(dat)

Rows: 110127 Columns: 241
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr  (15): ID_VIV, ID_PER, UPM, VIV_SEL, N_REN, DOMINIO, CVE_ENT, NOM_ENT, C...
dbl (144): HOGAR, P8_1, P8_2, P8_3_1_1, P8_3_1_2, P8_3_2_1, P8_3_2_2, P8_3_2...
lgl  (82): P8_12_1_3, P8_10_2_2, P8_10_2_3, P8_12_2_1, P8_12_2_2, P8_12_2_3,...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Piensa que podríamos tener más argumentos como: el nombre de la base, la carpeta, etc.

Importación de los diccionarios

Los diccionarios tienen una estructura parecida. Reformulemos nuestros directorios para hacer una función más general

cd2 <- "data_t3/conjunto_de_datos_endireh_2021_csv/conjunto_de_datos_"

importar2 <-function(tabla, elemento) {
  
  dir<-paste0(cd2,tabla,"/",elemento,"/",elemento, "_",tabla, ".csv")
  
  x<-readr::read_csv(paste0(dir), locale = locale(encoding = "latin1"))
  
  return(x)
}

TB_SEC_III<-importar2(tabla="TB_SEC_III",
                     elemento="conjunto_de_datos")

Rows: 110127 Columns: 25
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (14): ID_VIV, ID_PER, UPM, VIV_SEL, CVE_ENT, NOM_ENT, CVE_MUN, NOM_MUN, ...
dbl (11): HOGAR, P3_1, P3_2, P3_3, P3_4, P3_5, P3_6, P3_7, FAC_VIV, FAC_MUJ,...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

DIC_TB_SEC_III<-importar2(tabla="TB_SEC_III",
                     elemento="diccionario_de_datos")

Rows: 64 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Loop para importar diccionarios

Creamos un vector con las tablas que queremos importar:

tablas<-indice_tablas$nombre_de_archivo[c(1:4, 9)]

for(i in tablas) {
  
  y<-importar2(tabla=i, 
               elemento="diccionario_de_datos") # se importa la base

  assign(paste0("DICC_",i), y) # asigna el nombre al objeto y
}

Rows: 68 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 107 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 64 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 427 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 1488 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Cuidado con estos diccionarios. Hay varios instrumentos entonces se repiten.

for(i in tablas) {
  
  y<-importar2(tabla=i, 
               elemento="diccionario_de_datos") %>% # se importa la base
    select(NOMBRE_CAMPO:TIPO) %>% 
    unique()

  assign(paste0("DICC_",i), y) # asigna el nombre al objeto y
}

Rows: 68 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 107 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 64 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 427 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
Rows: 1488 Columns: 5
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (4): NOMBRE_CAMPO, NEMONICO, TIPO, RANGO_CLAVES
dbl (1): LONGITUD

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Etiquetado en bucle de variables

Hoy vamos a etiquetar las variables con un bucle.

Aquí solo lo he podido resolver con base

DICC_TB_SEC_III$NEMONICO

 [1] "ID_VIV"    "ID_PER"    "UPM"       "VIV_SEL"   "CVE_ENT"   "NOM_ENT"  
 [7] "CVE_MUN"   "NOM_MUN"   "HOGAR"     "T_INSTRUM" "N_REN"     "P3_1"     
[13] "P3_2"      "P3_3"      "P3_4"      "P3_5"      "P3_6"      "P3_7"     
[19] "P3_8"      "FAC_VIV"   "FAC_MUJ"   "DOMINIO"   "EST_DIS"   "UPM_DIS"  
[25] "ESTRATO"

TB_SEC_III[["ID_VIV"]] %>% head()

[1] "0100003.01" "0100003.02" "0100003.03" "0100003.05" "0100004.01"
[6] "0100004.02"

DICC_TB_SEC_III[DICC_TB_SEC_III$NEMONICO=="ID_VIV",]$NOMBRE_CAMPO

[1] "Identificador de vivienda seleccionada"

for (i in DICC_TB_SEC_III$NEMONICO) {
  
  TB_SEC_III[[i]]<-sjlabelled::set_label(TB_SEC_III[[i]], label=DICC_TB_SEC_III[DICC_TB_SEC_III$NEMONICO==i,]$NOMBRE_CAMPO)
  
}

for (i in DICC_TB_SEC_IV$NEMONICO) {
  
  TB_SEC_IV[[i]]<-sjlabelled::set_label(TB_SEC_IV[[i]], label=DICC_TB_SEC_IV[DICC_TB_SEC_IV$NEMONICO==i,]$NOMBRE_CAMPO)
  
}

Importación de catalagos

Aquí hay una pequeña complicación, hay un archivo por preguntar. Para esto nos servirán las funciones de dir()

dir("data_t3")

[1] "conjunto_de_datos_endireh_2021_csv" "endireh2021_fd.pdf"

Lista los archivos que hay. Si no tienen extensión son carpetas.

cd3<-paste0(cd2,"TB_SEC_III","/catalogos")
dir(cd3)

 [1] "CVE_ENT.csv"   "CVE_MUN.csv"   "DOMINIO.csv"   "ESTRATO.csv"  
 [5] "EST_DIS.csv"   "FAC_MUJ.csv"   "FAC_VIV.csv"   "HOGAR.csv"    
 [9] "ID_PER.csv"    "ID_VIV.csv"    "NOM_ENT.csv"   "NOM_MUN.csv"  
[13] "N_REN.csv"     "P3_1.csv"      "P3_2.csv"      "P3_3.csv"     
[17] "P3_4.csv"      "P3_5.csv"      "P3_6.csv"      "P3_7.csv"     
[21] "P3_8.csv"      "T_INSTRUM.csv" "UPM.csv"       "UPM_DIS.csv"  
[25] "VIV_SEL.csv"

Podemos guardar esa lista como un objeto caracter

cat_TB_SEC_III<-dir(cd3) %>% stringr::str_remove_all(".csv")

Etiquetado de valores

Vamos a etiquetar, ojo las variables de “id” no se etiquetan en valores.

vars<-cat_TB_SEC_III[c(1:8, 14:25)]

for (i in vars) {
  
  x <- readr::read_csv(paste0(cd3,"/",i,".csv"),
                locale = locale(encoding = "latin1")) %>% unique()
  
  TB_SEC_III[[i]]<-as.numeric(TB_SEC_III[[i]])
  TB_SEC_III[[i]]<-set_labels(TB_SEC_III[[i]], labels=x$descrip)
}

Rows: 1 Columns: 2
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): CVE_ENT, descrip

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
More values in "x" than length of "labels". Additional values were added to labels.

Rows: 1 Columns: 2
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): CVE_MUN, descrip

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
More values in "x" than length of "labels". Additional values were added to labels.

Rows: 5 Columns: 2
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): DOMINIO, descrip

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Warning: NAs introducidos por coerción

More labels than values of "x". Using first 0 labels.
Rows: 4 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): ESTRATO, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.Rows: 1 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): EST_DIS, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More values in "x" than length of "labels". Additional values were added to labels.
Rows: 1 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): FAC_MUJ, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More values in "x" than length of "labels". Additional values were added to labels.
Rows: 1 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): FAC_VIV, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More values in "x" than length of "labels". Additional values were added to labels.
Rows: 1 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): HOGAR, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More values in "x" than length of "labels". Additional values were added to labels.
Rows: 6 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (1): descrip
dbl (1): P3_1
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.Rows: 3 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): P3_2, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More labels than values of "x". Using first 2 labels.
Rows: 8 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): P3_3, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More labels than values of "x". Using first 7 labels.
Rows: 3 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): P3_4, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More labels than values of "x". Using first 2 labels.
Rows: 3 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): P3_5, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More labels than values of "x". Using first 2 labels.
Rows: 5 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): P3_6, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More labels than values of "x". Using first 4 labels.
Rows: 3 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): P3_7, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More labels than values of "x". Using first 2 labels.
Rows: 6 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): P3_8, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Warning: NAs introducidos por coerción

More labels than values of "x". Using first 0 labels.
Rows: 6 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): T_INSTRUM, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.

Warning: NAs introducidos por coerción

More labels than values of "x". Using first 0 labels.
Rows: 1 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): UPM, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More values in "x" than length of "labels". Additional values were added to labels.
Rows: 5 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): UPM_DIS, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More values in "x" than length of "labels". Additional values were added to labels.
Rows: 1 Columns: 2── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (2): VIV_SEL, descrip
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.More values in "x" than length of "labels". Additional values were added to labels.

Ejercicio, haz lo mismo para la sección IV

cd3<-paste0(cd2,"TB_SEC_IV","/catalogos")
dir(cd3)

 [1] "CVE_ENT.csv"   "CVE_MUN.csv"   "DOMINIO.csv"   "ESTRATO.csv"  
 [5] "EST_DIS.csv"   "FAC_MUJ.csv"   "FAC_VIV.csv"   "HOGAR.csv"    
 [9] "ID_PER.csv"    "ID_VIV.csv"    "NOM_ENT.csv"   "NOM_MUN.csv"  
[13] "N_REN.csv"     "N_REN_ESP.csv" "P4AB_1.csv"    "P4AB_2.csv"   
[17] "P4A_1.csv"     "P4A_2.csv"     "P4BC_1.csv"    "P4BC_2.csv"   
[21] "P4BC_3.csv"    "P4BC_4.csv"    "P4BC_5.csv"    "P4B_1.csv"    
[25] "P4B_2.csv"     "P4C_1.csv"     "P4_1.csv"      "P4_10_2_1.csv"
[29] "P4_10_2_2.csv" "P4_10_2_3.csv" "P4_10_3_1.csv" "P4_10_3_2.csv"
[33] "P4_10_3_3.csv" "P4_11.csv"     "P4_12_1.csv"   "P4_12_2.csv"  
[37] "P4_12_3.csv"   "P4_12_4.csv"   "P4_12_5.csv"   "P4_12_6.csv"  
[41] "P4_12_7.csv"   "P4_13_1.csv"   "P4_13_2.csv"   "P4_13_3.csv"  
[45] "P4_13_4.csv"   "P4_13_5.csv"   "P4_13_6.csv"   "P4_13_7.csv"  
[49] "P4_2.csv"      "P4_2_1.csv"    "P4_3.csv"      "P4_4.csv"     
[53] "P4_4_CVE.csv"  "P4_5_1_AB.csv" "P4_5_AB.csv"   "P4_6_AB.csv"  
[57] "P4_7_AB.csv"   "P4_8_1.csv"    "P4_8_2.csv"    "P4_8_3.csv"   
[61] "P4_8_4.csv"    "P4_8_5.csv"    "P4_8_6.csv"    "P4_8_7.csv"   
[65] "P4_9_1.csv"    "P4_9_2.csv"    "P4_9_3.csv"    "P4_9_4.csv"   
[69] "P4_9_5.csv"    "P4_9_6.csv"    "P4_9_7.csv"    "T_INSTRUM.csv"
[73] "UPM.csv"       "UPM_DIS.csv"   "VIV_SEL.csv"

DICC_TB_SEC_IV %>% 
  filter(TIPO=="Numérico") %>% 
  select(NEMONICO) -> vars

# for (i in vars$NEMONICO) {
#   
#   x <- read_csv(paste0(cd3,"/",i,".csv"),
#                 locale = locale(encoding = "latin1")) %>% unique() 
#   
#   TB_SEC_IV[[i]]<-as.numeric(TB_SEC_IV[[i]])
#   
#   TB_SEC_IV[[i]]<-set_labels(TB_SEC_IV[[i]], labels=x$descrip)
# }

Pueden haber errores pero pues, ahorra mucho

TB_SEC_III %>% 
  mutate(P3_1=as_label(P3_1)) %>% 
  tabyl(P3_1)

                 P3_1     n    percent
 vive en unión libre? 23597 0.21427080
       está separada?  8628 0.07834591
     está divorciada?  3173 0.02881219
            es viuda?  9750 0.08853415
         está casada? 44977 0.40841029
        está soltera? 20002 0.18162667

No siempre funciona… o funcional 100

TB_SEC_IV %>% 
  mutate(P4AB_1=as_label(P4AB_1)) %>% 
  tabyl(P4AB_1)

 P4AB_1     n   percent valid_percent
      1 31398 0.2851072    0.34620855
      2  2115 0.0192051    0.02332095
      3 36019 0.3270678    0.39716179
      4 21159 0.1921327    0.23330871
     NA 19436 0.1764871            NA

Lo ideal en este caso es etiquetar antes de fusionar.

Vamos a botar algunos objeto que ya no ocuparemos

Checa con glipmse()

glimpse(TB_SEC_III$P3_1) # perdimos la label

 num [1:110127] 5 5 4 6 2 4 2 4 5 5 ...
 - attr(*, "labels")= Named num [1:6] 1 2 3 4 5 6
  ..- attr(*, "names")= chr [1:6] "vive en unión libre?" "está separada?" "está divorciada?" "es viuda?" ...

rm(list=ls(pattern="^DICC_"))
rm(list=ls(pattern="^cat_"))

Fusionado

Para que se guarden las etiquetas debemos usar los “join” de {dplyr}

Además esta base no tiene una sólo variable del id. Tiene un identificador compuesto. Podemos hacer un objeto tipo vector

endireh2021_ind<-TB_SEC_III %>% 
  left_join(TB_SEC_IV, by="ID_PER") %>% 
  select(-ends_with(".y")) %>% # quita todas las variables que terminan en .y
  rename_with(~ stringr::str_remove(.x,  pattern = ".x"),  ends_with(".x")) %>% 
  left_join(TB_SEC_VIII, by="ID_PER") %>% 
  select(-ends_with(".y")) %>% # quita todas las variables que terminan en .y
  rename_with(~ stringr::str_remove(.x,  pattern = ".x"),  ends_with(".x")) 

endireh2021_ind %>% names()

  [1] "ID_VIV"     "ID_PER"     "UPM"        "VIV_SEL"    "CVE_ENT"   
  [6] "NOM_ENT"    "CVE_MUN"    "NOM_MUN"    "HOGAR"      "T_INSTRUM" 
 [11] "N_REN"      "P3_1"       "P3_2"       "P3_3"       "P3_4"      
 [16] "P3_5"       "P3_6"       "P3_7"       "P3_8"       "FAC_VIV"   
 [21] "FAC_MUJ"    "DOMINIO"    "ESTRATO"    "EST_DIS"    "UPM_DIS"   
 [26] "N_REN_ESP"  "P4AB_1"     "P4AB_2"     "P4A_1"      "P4A_2"     
 [31] "P4B_1"      "P4B_2"      "P4BC_1"     "P4BC_2"     "P4C_1"     
 [36] "P4BC_3"     "P4BC_4"     "P4BC_5"     "P4_1"       "P4_2"      
 [41] "P4_2_1"     "P4_3"       "P4_4"       "P4_4_CVE"   "P4_5_AB"   
 [46] "P4_5_1_AB"  "P4_6_AB"    "P4_7_AB"    "P4_8_1"     "P4_8_2"    
 [51] "P4_8_3"     "P4_8_4"     "P4_8_5"     "P4_8_6"     "P4_8_7"    
 [56] "P4_9_1"     "P4_9_2"     "P4_10_2_1"  "P4_10_2_2"  "P4_10_2_3" 
 [61] "P4_9_3"     "P4_10_3_1"  "P4_10_3_2"  "P4_10_3_3"  "P4_9_4"    
 [66] "P4_9_5"     "P4_9_6"     "P4_9_7"     "P4_11"      "P4_12_1"   
 [71] "P4_12_2"    "P4_12_3"    "P4_12_4"    "P4_12_5"    "P4_12_6"   
 [76] "P4_12_7"    "P4_13_1"    "P4_13_2"    "P4_13_3"    "P4_13_4"   
 [81] "P4_13_5"    "P4_13_6"    "P4_13_7"    "P8_1"       "P8_2"      
 [86] "P8_3_1_1"   "P8_3_1_2"   "P8_3_2_1"   "P8_3_2_2"   "P8_3_2_3"  
 [91] "P8_4"       "P8_5"       "P8_6"       "P8_6_CVE"   "P8_7"      
 [96] "P8_8_1"     "P8_8_2"     "P8_8_3"     "P8_8_4"     "P8_8_5"    
[101] "P8_8_6"     "P8_8_7"     "P8_8_8"     "P8_8_9"     "P8_9_1"    
[106] "P8_9_2"     "P8_9_3"     "P8_9_4"     "P8_9_5"     "P8_9_6"    
[111] "P8_9_7"     "P8_9_8"     "P8_9_9"     "P8_9_10"    "P8_9_11"   
[116] "P8_9_12"    "P8_9_13"    "P8_9_14"    "P8_9_15"    "P8_9_16"   
[121] "P8_9_17"    "P8_9_18"    "P8_9_19"    "P8_10_1_1"  "P8_10_1_2" 
[126] "P8_10_1_3"  "P8_11_1"    "P8_12_1_1"  "P8_12_1_2"  "P8_12_1_3" 
[131] "P8_10_2_1"  "P8_10_2_2"  "P8_10_2_3"  "P8_11_2"    "P8_12_2_1" 
[136] "P8_12_2_2"  "P8_12_2_3"  "P8_10_3_1"  "P8_10_3_2"  "P8_10_3_3" 
[141] "P8_11_3"    "P8_12_3_1"  "P8_12_3_2"  "P8_12_3_3"  "P8_13_3_1" 
[146] "P8_13_3_2"  "P8_13_3_3"  "P8_10_4_1"  "P8_10_4_2"  "P8_10_4_3" 
[151] "P8_11_4"    "P8_12_4_1"  "P8_12_4_2"  "P8_12_4_3"  "P8_13_4_1" 
[156] "P8_13_4_2"  "P8_13_4_3"  "P8_10_5_1"  "P8_10_5_2"  "P8_10_5_3" 
[161] "P8_11_5"    "P8_12_5_1"  "P8_12_5_2"  "P8_12_5_3"  "P8_13_5_1" 
[166] "P8_13_5_2"  "P8_13_5_3"  "P8_10_6_1"  "P8_10_6_2"  "P8_10_6_3" 
[171] "P8_11_6"    "P8_12_6_1"  "P8_12_6_2"  "P8_12_6_3"  "P8_13_6_1" 
[176] "P8_13_6_2"  "P8_13_6_3"  "P8_10_7_1"  "P8_10_7_2"  "P8_10_7_3" 
[181] "P8_11_7"    "P8_12_7_1"  "P8_12_7_2"  "P8_12_7_3"  "P8_13_7_1" 
[186] "P8_13_7_2"  "P8_13_7_3"  "P8_10_8_1"  "P8_10_8_2"  "P8_10_8_3" 
[191] "P8_11_8"    "P8_12_8_1"  "P8_12_8_2"  "P8_12_8_3"  "P8_13_8_1" 
[196] "P8_13_8_2"  "P8_13_8_3"  "P8_10_9_1"  "P8_10_9_2"  "P8_10_9_3" 
[201] "P8_11_9"    "P8_12_9_1"  "P8_12_9_2"  "P8_12_9_3"  "P8_13_9_1" 
[206] "P8_13_9_2"  "P8_13_9_3"  "P8_10_10_1" "P8_10_10_2" "P8_10_10_3"
[211] "P8_11_10"   "P8_12_10_1" "P8_12_10_2" "P8_12_10_3" "P8_13_10_1"
[216] "P8_13_10_2" "P8_13_10_3" "P8_10_11_1" "P8_10_11_2" "P8_10_11_3"
[221] "P8_11_11"   "P8_12_11_1" "P8_12_11_2" "P8_12_11_3" "P8_13_11_1"
[226] "P8_13_11_2" "P8_13_11_3" "P8_10_12_1" "P8_10_12_2" "P8_10_12_3"
[231] "P8_11_12"   "P8_12_12_1" "P8_12_12_2" "P8_12_12_3" "P8_13_12_1"
[236] "P8_13_12_2" "P8_13_12_3" "P8_10_13_1" "P8_10_13_2" "P8_10_13_3"
[241] "P8_11_13"   "P8_12_13_1" "P8_12_13_2" "P8_12_13_3" "P8_13_13_1"
[246] "P8_13_13_2" "P8_13_13_3" "P8_10_14_1" "P8_10_14_2" "P8_10_14_3"
[251] "P8_11_14"   "P8_12_14_1" "P8_12_14_2" "P8_12_14_3" "P8_13_14_1"
[256] "P8_13_14_2" "P8_13_14_3" "P8_10_15_1" "P8_10_15_2" "P8_10_15_3"
[261] "P8_11_15"   "P8_12_15_1" "P8_12_15_2" "P8_12_15_3" "P8_13_15_1"
[266] "P8_13_15_2" "P8_13_15_3" "P8_10_16_1" "P8_10_16_2" "P8_10_16_3"
[271] "P8_11_16"   "P8_12_16_1" "P8_12_16_2" "P8_12_16_3" "P8_13_16_1"
[276] "P8_13_16_2" "P8_13_16_3" "P8_10_17_1" "P8_10_17_2" "P8_10_17_3"
[281] "P8_11_17"   "P8_12_17_1" "P8_12_17_2" "P8_12_17_3" "P8_13_17_1"
[286] "P8_13_17_2" "P8_13_17_3" "P8_10_18_1" "P8_10_18_2" "P8_10_18_3"
[291] "P8_11_18"   "P8_12_18_1" "P8_12_18_2" "P8_12_18_3" "P8_13_18_1"
[296] "P8_13_18_2" "P8_13_18_3" "P8_10_19_1" "P8_10_19_2" "P8_10_19_3"
[301] "P8_11_19"   "P8_12_19_1" "P8_12_19_2" "P8_12_19_3" "P8_13_19_1"
[306] "P8_13_19_2" "P8_13_19_3"

Hoy pegamos estas 110,127 mujeres al sociodemográfico

endireh2021<-endireh2021_ind %>% 
  right_join(TSDem, by="ID_PER") %>%  # ojo con el right ¿por qué?
  select(-ends_with(".x")) %>% # quita todas las variables que terminan en .x
  rename_with(~ stringr::str_remove(.x,  pattern = ".y"),  ends_with(".y"))

Finalmente pegamos la vivienda.

endireh2021<-endireh2021 %>% 
  right_join(TVIV, by="ID_VIV") %>%  # ojo con el right ¿por qué?
  select(-ends_with(".x")) %>% # quita todas las variables que terminan en .x
  rename_with(~ stringr::str_remove(.x,  pattern = ".y"),  ends_with(".y"))

rm(endireh2021_ind, indice_tablas)

Skim

endireh2021 %>% 
  skimr::skim()

Data summary
Name	Piped data
Number of rows	432746
Number of columns	351
_______________________
Column type frequency:
character	50
logical	84
numeric	217
________________________
Group variables	None

Variable type: character

skim_variable	n_missing	complete_rate	min	max	n_unique
ID_PER	0	1.00	15	15	432746
N_REN_ESP	364206	0.16	2	2	16
P4AB_2	408186	0.06	2	2	70
P4BC_2	396008	0.08	2	2	13
P4_2	383766	0.11	6	6	545
P4_4	361532	0.16	3	198	23954
P4_5_AB	369721	0.15	6	6	365
P4_7_AB	365948	0.15	6	6	300
P4_9_1	425312	0.02	6	6	279
P4_9_2	429400	0.01	6	6	104
P4_9_3	427387	0.01	6	6	103
P4_9_4	427310	0.01	6	6	162
P4_9_5	430655	0.00	6	6	129
P4_9_6	417572	0.04	6	6	246
P4_9_7	431864	0.00	6	6	102
P4_13_1	418194	0.03	2	2	9
P4_13_2	387815	0.10	2	2	9
P4_13_3	420806	0.03	2	2	9
P4_13_4	359756	0.17	2	2	10
P4_13_5	429554	0.01	2	2	9
P4_13_6	431102	0.00	2	2	9
P4_13_7	427327	0.01	2	2	9
P8_6	394950	0.09	3	164	15143
P8_7	394950	0.09	2	2	15
ID_VIV	0	1.00	10	10	122646
N_REN	0	1.00	2	2	24
NOMBRE	0	1.00	1	1	1
PAREN	0	1.00	2	2	11
EDAD	0	1.00	2	2	100
P2_5	0	1.00	2	2	24
P2_6	0	1.00	2	2	22
NIV	16648	0.96	2	2	12
P2_14	271145	0.37	2	2	13
REN_MUJ_EL	322619	0.25	2	2	16
REN_INF_AD	310100	0.28	2	2	18
UPM	0	1.00	7	7	17763
VIV_SEL	0	1.00	2	2	24
CVE_ENT	0	1.00	2	2	32
NOM_ENT	0	1.00	6	31	32
CVE_MUN	0	1.00	3	3	269
NOM_MUN	0	1.00	4	49	1206
COD_RES	0	1.00	2	2	2
P1_2	0	1.00	2	2	12
P1_2_A	0	1.00	2	2	21
P1_3	0	1.00	2	2	76
P1_7	0	1.00	2	2	24
P1_9	411265	0.05	2	2	5
DOMINIO	0	1.00	1	1	3
EST_DIS	0	1.00	4	4	617
UPM_DIS	0	1.00	5	5	17763

Variable type: logical

skim_variable	n_missing	mean	count
P4_10_2_3	432746	NaN	:
P4_10_3_3	432746	NaN	:
P8_12_1_3	432746	NaN	:
P8_10_2_2	432743	1	TRU: 3
P8_10_2_3	432746	NaN	:
P8_12_2_1	432727	1	TRU: 19
P8_12_2_2	432746	NaN	:
P8_12_2_3	432746	NaN	:
P8_12_3_2	432746	NaN	:
P8_12_3_3	432746	NaN	:
P8_13_3_2	432746	NaN	:
P8_13_3_3	432746	NaN	:
P8_10_4_3	432746	NaN	:
P8_12_4_2	432746	NaN	:
P8_12_4_3	432746	NaN	:
P8_13_4_2	432746	NaN	:
P8_13_4_3	432746	NaN	:
P8_10_5_3	432745	1	TRU: 1
P8_12_5_2	432746	NaN	:
P8_12_5_3	432746	NaN	:
P8_13_7_2	432746	NaN	:
P8_13_7_3	432746	NaN	:
P8_10_8_2	432746	NaN	:
P8_10_8_3	432746	NaN	:
P8_12_8_2	432746	NaN	:
P8_12_8_3	432746	NaN	:
P8_13_8_2	432746	NaN	:
P8_13_8_3	432746	NaN	:
P8_10_9_1	432716	1	TRU: 30
P8_10_9_2	432746	NaN	:
P8_10_9_3	432746	NaN	:
P8_11_9	432736	1	TRU: 10
P8_12_9_1	432741	1	TRU: 5
P8_12_9_2	432746	NaN	:
P8_12_9_3	432746	NaN	:
P8_13_9_1	432684	1	TRU: 62
P8_13_9_2	432746	NaN	:
P8_13_9_3	432746	NaN	:
P8_10_10_2	432746	NaN	:
P8_10_10_3	432746	NaN	:
P8_12_10_2	432746	NaN	:
P8_12_10_3	432746	NaN	:
P8_13_10_2	432746	NaN	:
P8_13_10_3	432746	NaN	:
P8_10_11_3	432746	NaN	:
P8_12_11_2	432746	NaN	:
P8_12_11_3	432746	NaN	:
P8_13_11_2	432746	NaN	:
P8_13_11_3	432746	NaN	:
P8_10_12_3	432746	NaN	:
P8_12_12_2	432746	NaN	:
P8_12_12_3	432746	NaN	:
P8_10_13_2	432746	NaN	:
P8_10_13_3	432746	NaN	:
P8_12_13_2	432746	NaN	:
P8_12_13_3	432746	NaN	:
P8_13_13_2	432746	NaN	:
P8_13_13_3	432746	NaN	:
P8_10_14_2	432746	NaN	:
P8_10_14_3	432746	NaN	:
P8_12_14_2	432746	NaN	:
P8_12_14_3	432746	NaN	:
P8_13_14_2	432746	NaN	:
P8_13_14_3	432746	NaN	:
P8_10_15_3	432745	1	TRU: 1
P8_12_15_2	432746	NaN	:
P8_12_15_3	432746	NaN	:
P8_13_15_2	432746	NaN	:
P8_13_15_3	432746	NaN	:
P8_10_16_2	432745	1	TRU: 1
P8_10_16_3	432746	NaN	:
P8_12_16_2	432746	NaN	:
P8_12_16_3	432746	NaN	:
P8_13_16_2	432746	NaN	:
P8_13_16_3	432746	NaN	:
P8_13_17_3	432746	NaN	:
P8_13_18_2	432746	NaN	:
P8_13_18_3	432746	NaN	:
P8_10_19_2	432746	NaN	:
P8_10_19_3	432746	NaN	:
P8_12_19_2	432746	NaN	:
P8_12_19_3	432746	NaN	:
P8_13_19_2	432746	NaN	:
P8_13_19_3	432746	NaN	:

Variable type: numeric

skim_variable	n_missing	complete_rate	mean	sd	p0	p25	p50	p75	p100	hist
T_INSTRUM	432746	0.00	NaN	NA	NA	NA	NA	NA	NA
P3_1	322619	0.25	3.94	1.84	1	2.00	5.0	5.00	6	▆▁▂▇▃
P3_2	364172	0.16	1.04	0.19	1	1.00	1.0	1.00	2	▇▁▁▁▁
P3_3	430175	0.01	2.85	1.96	1	1.00	3.0	3.00	7	▇▆▁▁▃
P3_4	411195	0.05	1.93	0.25	1	2.00	2.0	2.00	2	▁▁▁▁▇
P3_5	431261	0.00	1.96	0.20	1	2.00	2.0	2.00	2	▁▁▁▁▇
P3_6	412744	0.05	2.69	1.11	1	1.00	3.0	3.00	4	▅▁▁▇▅
P3_7	423107	0.02	1.80	0.40	1	2.00	2.0	2.00	2	▂▁▁▁▇
P3_8	432746	0.00	NaN	NA	NA	NA	NA	NA	NA
P4AB_1	342055	0.21	2.52	1.19	1	1.00	3.0	3.00	4	▇▁▁▇▅
P4A_1	430337	0.01	4.56	2.81	1	2.00	5.0	7.00	9	▇▆▅▃▆
P4A_2	430337	0.01	4.99	2.64	1	3.00	5.0	8.00	8	▅▂▅▂▇
P4B_1	410595	0.05	1.99	0.93	1	1.00	2.0	3.00	3	▇▁▂▁▇
P4B_2	420179	0.03	3.87	2.14	1	2.00	3.0	5.00	9	▇▆▃▁▃
P4BC_1	396008	0.08	49.47	24.36	12	29.00	46.0	65.00	99	▇▇▇▃▃
P4C_1	418159	0.03	2.70	2.27	1	2.00	2.0	2.00	8	▇▁▁▁▂
P4BC_3	396008	0.08	3.07	1.69	1	3.00	3.0	3.00	8	▂▇▁▁▁
P4BC_4	396008	0.08	1.94	0.23	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4BC_5	430619	0.00	1.05	0.22	1	1.00	1.0	1.00	2	▇▁▁▁▁
P4_1	322619	0.25	1.56	0.50	1	1.00	2.0	2.00	2	▆▁▁▁▇
P4_2_1	388187	0.10	1.69	0.87	1	1.00	1.0	3.00	9	▇▃▁▁▁
P4_3	337052	0.22	1.83	2.05	1	1.00	1.0	2.00	9	▇▁▁▁▁
P4_4_CVE	361533	0.16	6257.34	2695.63	980	4111.00	6311.0	8349.00	9999	▅▃▅▅▇
P4_5_1_AB	385711	0.11	1.57	0.83	1	1.00	1.0	2.00	9	▇▂▁▁▁
P4_6_AB	351639	0.19	1.23	0.67	1	1.00	1.0	1.00	9	▇▁▁▁▁
P4_8_1	322619	0.25	1.93	0.25	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_8_2	322619	0.25	1.97	0.17	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_8_3	322619	0.25	1.95	0.22	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_8_4	322619	0.25	1.95	0.22	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_8_5	322619	0.25	1.98	0.14	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_8_6	322619	0.25	1.86	0.34	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_8_7	322619	0.25	1.99	0.09	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_10_2_1	429400	0.01	2.36	0.92	1	2.00	2.0	3.00	5	▂▇▂▂▁
P4_10_2_2	432610	0.00	3.56	0.73	2	3.00	4.0	4.00	5	▁▇▁▇▂
P4_10_3_1	427388	0.01	2.07	0.83	1	2.00	2.0	2.00	5	▂▇▁▁▁
P4_10_3_2	432480	0.00	3.67	0.67	2	3.00	4.0	4.00	5	▁▆▁▇▁
P4_11	322619	0.25	1.48	0.50	1	1.00	1.0	2.00	2	▇▁▁▁▇
P4_12_1	322619	0.25	1.87	0.34	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_12_2	322619	0.25	1.59	0.49	1	1.00	2.0	2.00	2	▆▁▁▁▇
P4_12_3	322619	0.25	1.89	0.31	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_12_4	322619	0.25	1.34	0.47	1	1.00	1.0	2.00	2	▇▁▁▁▅
P4_12_5	322619	0.25	1.97	0.17	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_12_6	322619	0.25	1.99	0.12	1	2.00	2.0	2.00	2	▁▁▁▁▇
P4_12_7	322619	0.25	1.95	0.22	1	2.00	2.0	2.00	2	▁▁▁▁▇
P8_1	322619	0.25	1.19	0.40	1	1.00	1.0	1.00	2	▇▁▁▁▂
P8_2	343986	0.21	1.24	0.43	1	1.00	1.0	1.00	2	▇▁▁▁▂
P8_3_1_1	365660	0.16	1.99	0.78	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_3_1_2	365660	0.16	2.05	0.74	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_3_2_1	365660	0.16	2.76	0.97	1	2.00	3.0	3.00	9	▅▇▁▁▁
P8_3_2_2	365660	0.16	2.76	0.96	1	2.00	3.0	3.00	9	▅▇▁▁▁
P8_3_2_3	365660	0.16	2.76	0.96	1	2.00	3.0	3.00	9	▅▇▁▁▁
P8_4	365660	0.16	1.18	0.38	1	1.00	1.0	1.00	2	▇▁▁▁▂
P8_5	377418	0.13	2.09	1.50	1	1.00	1.0	4.00	6	▇▁▃▁▁
P8_6_CVE	394950	0.09	5618.39	2993.52	980	3111.00	4231.0	9411.00	9999	▆▇▃▂▇
P8_8_1	394950	0.09	2.08	0.73	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_8_2	394950	0.09	2.07	0.74	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_8_3	394950	0.09	2.13	0.71	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_8_4	394950	0.09	2.14	0.67	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_8_5	394950	0.09	2.16	0.70	1	2.00	2.0	2.00	9	▇▂▁▁▁
P8_8_6	394950	0.09	2.68	0.79	1	2.00	3.0	3.00	9	▅▇▁▁▁
P8_8_7	394950	0.09	2.13	0.68	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_8_8	394950	0.09	2.09	0.69	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_8_9	394950	0.09	2.00	0.61	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_1	343986	0.21	1.99	0.47	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_2	343986	0.21	2.02	0.44	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_3	343986	0.21	1.98	0.48	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_4	343986	0.21	2.01	0.46	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_5	343986	0.21	1.98	0.48	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_6	343986	0.21	1.94	0.52	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_7	343986	0.21	1.97	0.49	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_8	343986	0.21	2.02	0.44	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_9	343986	0.21	2.02	0.44	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_10	343986	0.21	2.02	0.44	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_11	343986	0.21	2.00	0.46	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_12	343986	0.21	2.00	0.46	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_13	343986	0.21	2.01	0.45	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_14	343986	0.21	2.02	0.44	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_15	343986	0.21	1.99	0.48	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_16	343986	0.21	2.01	0.45	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_17	343986	0.21	1.98	0.48	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_18	343986	0.21	1.98	0.49	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_9_19	343986	0.21	2.02	0.45	1	2.00	2.0	2.00	9	▇▁▁▁▁
P8_10_1_1	429702	0.01	3.55	1.88	1	2.00	4.0	4.00	8	▅▁▇▁▁
P8_10_1_2	432009	0.00	4.49	1.53	1	4.00	4.0	5.00	8	▁▁▇▂▁
P8_10_1_3	432519	0.00	5.35	1.69	1	4.00	5.0	7.00	8	▁▁▇▂▅
P8_11_1	430239	0.01	3.09	1.15	1	2.00	4.0	4.00	9	▅▇▁▁▁
P8_12_1_1	431558	0.00	3.69	1.81	1	2.00	4.0	5.00	8	▃▁▇▁▁
P8_12_1_2	432470	0.00	4.69	1.50	2	4.00	4.0	5.00	8	▃▇▅▂▂
P8_10_2_1	432262	0.00	4.50	1.96	1	4.00	4.0	6.00	8	▂▁▇▂▂
P8_11_2	432339	0.00	3.23	1.08	1	2.00	4.0	4.00	9	▃▇▁▁▁
P8_10_3_1	428664	0.01	2.42	1.78	1	1.00	2.0	3.00	8	▇▂▂▁▁
P8_10_3_2	432090	0.00	3.98	1.60	1	3.00	4.0	5.00	8	▃▃▇▁▂
P8_10_3_3	432576	0.00	5.14	1.60	1	4.00	5.0	6.00	8	▁▂▇▁▃
P8_11_3	429930	0.01	3.45	1.14	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_3_1	431849	0.00	2.63	1.77	1	1.00	2.0	4.00	8	▇▂▃▁▁
P8_13_3_1	431849	0.00	1.30	0.96	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_4_1	430821	0.00	2.16	1.56	1	1.00	2.0	3.00	8	▇▂▁▁▁
P8_10_4_2	432457	0.00	3.57	1.50	2	2.00	3.0	4.00	8	▇▅▁▁▁
P8_11_4	431350	0.00	3.38	1.14	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_4_1	432274	0.00	2.28	1.47	1	1.00	2.0	3.00	8	▇▂▂▁▁
P8_13_4_1	432274	0.00	1.16	0.72	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_5_1	428761	0.01	3.26	2.12	1	1.00	3.0	4.00	8	▇▁▆▁▂
P8_10_5_2	432126	0.00	4.56	1.62	1	4.00	4.0	5.00	8	▂▂▇▂▂
P8_11_5	430061	0.01	3.37	1.15	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_5_1	431822	0.00	3.52	1.93	1	2.00	4.0	5.00	8	▆▁▇▁▁
P8_13_5_1	431822	0.00	1.33	0.88	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_13_5_2	432672	0.00	2.70	1.13	2	2.00	2.0	3.00	6	▇▂▁▂▁
P8_13_5_3	432736	0.00	3.50	0.71	3	3.00	3.0	4.00	5	▇▁▃▁▁
P8_10_6_1	425030	0.02	3.72	1.80	1	2.00	4.0	4.00	8	▃▁▇▁▁
P8_10_6_2	430989	0.00	4.60	1.47	1	4.00	4.0	5.00	8	▁▁▇▂▁
P8_10_6_3	432203	0.00	5.36	1.51	1	4.00	5.0	6.00	8	▁▁▇▃▃
P8_11_6	427212	0.01	3.12	1.22	1	2.00	4.0	4.00	9	▅▇▁▁▁
P8_12_6_1	430291	0.01	3.89	1.64	1	4.00	4.0	5.00	8	▂▁▇▁▁
P8_12_6_2	432196	0.00	4.76	1.37	2	4.00	5.0	5.00	8	▂▇▆▃▂
P8_12_6_3	432579	0.00	5.89	1.52	3	5.00	6.0	7.50	8	▇▇▇▂▇
P8_13_6_1	430291	0.01	1.25	0.84	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_13_6_2	432564	0.00	2.52	1.00	2	2.00	2.0	3.00	6	▇▁▁▁▁
P8_13_6_3	432716	0.00	3.63	0.81	3	3.00	3.0	4.00	6	▇▅▁▂▁
P8_10_7_1	427724	0.01	3.15	1.94	1	1.00	3.0	4.00	8	▇▂▇▁▁
P8_10_7_2	431646	0.00	4.31	1.62	1	3.00	4.0	5.00	8	▂▂▇▁▂
P8_10_7_3	432427	0.00	5.04	1.63	1	4.00	5.0	6.00	8	▁▂▇▂▃
P8_11_7	429292	0.01	3.20	1.23	1	2.00	4.0	4.00	9	▃▇▁▁▁
P8_12_7_1	431315	0.00	3.29	1.86	1	1.00	4.0	4.00	8	▇▁▇▁▁
P8_12_7_2	432423	0.00	4.37	1.50	2	4.00	4.0	5.00	8	▅▇▅▂▂
P8_12_7_3	432650	0.00	5.15	1.47	3	4.00	5.0	6.00	8	▇▆▃▂▂
P8_13_7_1	431315	0.00	1.22	0.82	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_8_1	432329	0.00	4.30	2.11	1	4.00	4.0	6.00	8	▃▁▇▂▃
P8_11_8	432450	0.00	3.48	0.99	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_8_1	432644	0.00	4.03	2.03	1	3.25	4.0	5.00	8	▃▁▇▁▂
P8_13_8_1	432644	0.00	1.44	0.92	1	1.00	1.0	2.00	5	▇▂▁▁▁
P8_10_10_1	432533	0.00	3.35	2.07	1	1.00	4.0	4.00	8	▇▁▇▁▂
P8_11_10	432595	0.00	3.34	1.18	1	2.50	4.0	4.00	9	▃▇▁▁▁
P8_12_10_1	432684	0.00	3.87	1.92	1	2.25	4.0	5.00	8	▃▁▇▁▁
P8_13_10_1	432684	0.00	1.58	1.44	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_11_1	430369	0.01	3.95	1.27	1	4.00	4.0	4.00	8	▁▁▇▁▁
P8_10_11_2	432421	0.00	4.88	1.81	1	4.00	4.0	6.00	8	▂▁▇▂▃
P8_11_11	430901	0.00	3.23	1.19	1	2.00	4.0	4.00	9	▃▇▁▁▁
P8_12_11_1	432018	0.00	3.97	1.21	1	4.00	4.0	4.00	8	▁▁▇▁▁
P8_13_11_1	432018	0.00	1.14	0.64	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_12_1	430323	0.01	4.87	2.02	1	4.00	5.0	6.00	8	▃▁▇▆▃
P8_10_12_2	432453	0.00	5.56	1.84	1	4.00	6.0	8.00	8	▁▁▇▅▆
P8_11_12	431043	0.00	3.30	1.13	1	2.00	4.0	4.00	9	▃▇▁▁▁
P8_12_12_1	432081	0.00	4.92	1.89	1	4.00	5.0	6.00	8	▂▁▇▅▂
P8_13_12_1	432081	0.00	1.83	0.74	1	1.00	2.0	2.00	5	▅▇▂▁▁
P8_13_12_2	432676	0.00	2.54	0.85	2	2.00	2.0	3.00	5	▇▂▁▁▁
P8_13_12_3	432731	0.00	3.13	0.35	3	3.00	3.0	3.00	4	▇▁▁▁▁
P8_10_13_1	431509	0.00	2.88	2.14	1	1.00	2.0	4.00	8	▇▁▃▁▂
P8_11_13	431961	0.00	3.58	1.17	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_13_1	432528	0.00	3.05	2.01	1	1.00	3.0	4.00	8	▇▁▆▁▁
P8_13_13_1	432528	0.00	1.45	1.10	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_14_1	432380	0.00	3.24	2.38	1	1.00	3.0	5.00	8	▇▁▃▁▂
P8_11_14	432519	0.00	3.48	1.17	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_14_1	432676	0.00	3.27	2.33	1	1.00	3.5	5.00	8	▇▁▆▁▂
P8_13_14_1	432676	0.00	2.16	1.64	1	1.00	1.0	3.00	6	▇▁▁▂▁
P8_10_15_1	429080	0.01	3.34	1.95	1	1.00	4.0	4.00	8	▇▁▇▁▂
P8_10_15_2	432243	0.00	4.56	1.46	1	4.00	4.0	5.00	8	▁▁▇▁▂
P8_11_15	430135	0.01	3.40	1.06	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_15_1	431816	0.00	3.53	1.73	1	2.00	4.0	4.00	8	▅▁▇▁▁
P8_13_15_1	431816	0.00	1.25	0.82	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_16_1	431620	0.00	3.90	2.22	1	1.00	4.0	6.00	8	▇▁▇▂▃
P8_11_16	431933	0.00	3.50	1.10	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_16_1	432488	0.00	4.13	1.92	1	3.00	4.0	5.00	8	▃▁▇▂▁
P8_13_16_1	432488	0.00	1.44	1.01	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_17_1	428939	0.01	2.64	1.76	1	1.00	2.0	4.00	8	▇▂▃▁▁
P8_10_17_2	431725	0.00	3.75	1.46	1	3.00	4.0	4.00	8	▃▂▇▁▁
P8_10_17_3	432418	0.00	4.65	1.58	1	4.00	4.0	5.00	8	▁▃▇▁▂
P8_11_17	429977	0.01	3.12	1.25	1	2.00	4.0	4.00	9	▅▇▁▁▁
P8_12_17_1	431512	0.00	2.58	1.62	1	1.00	2.0	4.00	8	▇▂▃▁▁
P8_12_17_2	432383	0.00	3.79	1.37	2	3.00	4.0	4.00	8	▆▇▂▁▁
P8_12_17_3	432634	0.00	4.89	1.54	3	4.00	5.0	6.00	8	▇▃▂▁▂
P8_13_17_1	431512	0.00	1.17	0.76	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_13_17_2	432713	0.00	2.76	1.32	2	2.00	2.0	3.00	6	▇▂▁▁▁
P8_10_18_1	428242	0.01	3.64	1.86	1	2.00	4.0	4.00	8	▅▁▇▁▁
P8_10_18_2	431756	0.00	4.47	1.63	1	4.00	4.0	5.00	8	▂▂▇▂▂
P8_10_18_3	432444	0.00	5.06	1.64	1	4.00	5.0	6.00	8	▁▂▇▂▃
P8_11_18	429510	0.01	3.15	1.21	1	2.00	4.0	4.00	9	▅▇▁▁▁
P8_12_18_1	431307	0.00	3.66	1.74	1	2.00	4.0	4.00	8	▅▁▇▁▁
P8_12_18_2	432405	0.00	4.60	1.56	2	4.00	4.0	5.00	8	▃▇▃▂▂
P8_12_18_3	432648	0.00	5.37	1.58	3	4.00	5.0	6.75	8	▇▅▃▂▃
P8_13_18_1	431307	0.00	1.21	0.75	1	1.00	1.0	1.00	6	▇▁▁▁▁
P8_10_19_1	431709	0.00	3.87	1.88	1	3.00	4.0	4.00	8	▃▁▇▁▂
P8_11_19	432027	0.00	3.36	1.14	1	3.00	4.0	4.00	9	▂▇▁▁▁
P8_12_19_1	432465	0.00	3.94	1.59	1	4.00	4.0	4.00	8	▂▁▇▁▁
P8_13_19_1	432465	0.00	1.31	0.95	1	1.00	1.0	1.00	6	▇▁▁▁▁
HOGAR	0	1.00	1.03	0.21	1	1.00	1.0	1.00	6	▇▁▁▁▁
SEXO	0	1.00	1.51	0.50	1	1.00	2.0	2.00	2	▇▁▁▁▇
GRA	16648	0.96	3.12	1.62	0	2.00	3.0	4.00	9	▂▇▂▂▁
P2_8	275478	0.36	1.30	0.50	1	1.00	1.0	2.00	9	▇▁▁▁▁
P2_9	16648	0.96	1.71	0.45	1	1.00	2.0	2.00	2	▃▁▁▁▇
P2_10	16648	0.96	2.53	1.14	1	1.00	3.0	3.00	8	▃▇▁▁▁
P2_11	16648	0.96	1.93	0.25	1	2.00	2.0	2.00	2	▁▁▁▁▇
P2_12	405162	0.06	1.08	0.26	1	1.00	1.0	1.00	2	▇▁▁▁▁
P2_13	81468	0.81	1.46	0.50	1	1.00	1.0	2.00	2	▇▁▁▁▇
P2_15	230938	0.47	2.17	1.50	1	1.00	1.0	4.00	6	▇▁▃▁▁
P2_16	81468	0.81	4.35	1.86	1	3.00	5.0	6.00	6	▆▁▁▇▇
COD_M15	263134	0.39	1.00	0.00	1	1.00	1.0	1.00	1	▁▁▇▁▁
CODIGO	263134	0.39	0.65	0.48	0	0.00	1.0	1.00	1	▅▁▁▁▇
FAC_MUJ	0	1.00	116.75	326.98	0	0.00	0.0	58.00	8836	▇▁▁▁▁
P1_1	0	1.00	2.40	0.55	1	2.00	2.0	3.00	3	▁▁▇▁▆
P1_4_1	0	1.00	1.41	0.49	1	1.00	1.0	2.00	2	▇▁▁▁▆
P1_4_2	0	1.00	1.08	0.27	1	1.00	1.0	1.00	2	▇▁▁▁▁
P1_4_3	0	1.00	1.58	0.49	1	1.00	2.0	2.00	2	▆▁▁▁▇
P1_4_4	0	1.00	1.64	0.48	1	1.00	2.0	2.00	2	▅▁▁▁▇
P1_4_5	0	1.00	1.06	0.24	1	1.00	1.0	1.00	2	▇▁▁▁▁
P1_4_6	0	1.00	1.10	0.30	1	1.00	1.0	1.00	2	▇▁▁▁▁
P1_4_7	0	1.00	1.24	0.42	1	1.00	1.0	1.00	2	▇▁▁▁▂
P1_4_8	0	1.00	1.51	0.50	1	1.00	2.0	2.00	2	▇▁▁▁▇
P1_4_9	0	1.00	1.36	0.48	1	1.00	1.0	2.00	2	▇▁▁▁▅
P1_5	0	1.00	1.43	1.10	1	1.00	1.0	1.00	6	▇▁▁▁▁
P1_6	0	1.00	1.36	0.76	1	1.00	1.0	2.00	5	▇▂▁▁▁
P1_8	0	1.00	1.05	0.22	1	1.00	1.0	1.00	2	▇▁▁▁▁
P1_10_1	411265	0.05	0.11	0.31	0	0.00	0.0	0.00	1	▇▁▁▁▁
P1_10_2	411265	0.05	0.84	0.36	0	1.00	1.0	1.00	1	▂▁▁▁▇
P1_10_3	411265	0.05	0.11	0.32	0	0.00	0.0	0.00	1	▇▁▁▁▁
P1_10_4	411265	0.05	0.13	0.33	0	0.00	0.0	0.00	1	▇▁▁▁▁
FAC_VIV	0	1.00	295.70	252.31	12	132.00	226.0	363.00	2826	▇▁▁▁▁
ESTRATO	0	1.00	2.14	0.82	1	2.00	2.0	3.00	4	▃▇▁▃▁