Posts by Sandro • 145 points
4 posts
-
2
votes1
answer578
viewsQ: Compare fields in two datasets
Considering two sets of read data from type files *.CSV with the Pandas. Each set has only one field CPF Favorecido,where there are millions of records. Each data set is equivalent to one month. I…
-
3
votes1
answer8603
viewsA: Grouping and aggregating data
I was able to find the solution using Pandas' "groupby". I created two distinct clusters, but by the same fields, one per value (sum of the column Value) and the other counting the number of NIS.…
-
3
votes1
answer8603
viewsQ: Grouping and aggregating data
I have the following file on CSV (12 Millions of records): UF Municipio Cod NIS Valor Data MA IMPERATRIZ 803 16361947271 45.00 01/01/2011 MA IMPERATRIZ 803 74629273937 15.00 01/01/2011 BA RUY…
-
6
votes1
answer80
viewsQ: Max of a numeric field returning NA
I’m starting to learn R and I came across a situation I don’t understand. I downloaded the data from ENEM 2014 (CSV file) and read using: dados_enem <- read.csv(file="MICRODADOS_ENEM_2014.csv",…