1
Dataframe:
dataNomeAlunos
NomeAluno Nota Materia
Marcos Souza Magalhaes 10 Port
Junior Moraes 5 Mat
Mirela Chavier 3 Port
dataNomeMatch
NomeAluno Nota Materia
Marcos Sousa Magalhaes 10 Portugues
Jr Moraes 5 Matematica
I have a question about how the function works levenshteinSim
, bundle RecordLinkage
.
I have two dataframes, dataNomeAlunos
and dataNomeMatch
, how it reacts when the columns have different amount of information?
And how could I use his feedback to identify the "match" ?
Dice.
dataNomeAlunos <- read.table(text = "
NomeAluno Nota Materia
'Marcos Souza Magalhaes' 10 Port
'Junior Moraes' 5 Mat
'Mirela Chavier' 3 Port
", header = TRUE, stringsAsFactors = FALSE)
dataNomeMatch <- read.table(text = "
NomeAluno Nota Materia
'Marcos Sousa Magalhaes' 10 Portugues
'Jr Moraes' 5 Matematica
", header = TRUE, stringsAsFactors = FALSE)