levenshteinSim in R using two dataframe

Asked

Viewed 25 times

1

Dataframe:

dataNomeAlunos

    NomeAluno                 Nota Materia
    Marcos Souza Magalhaes     10  Port
    Junior Moraes               5  Mat
    Mirela Chavier              3  Port

dataNomeMatch

   NomeAluno                 Nota Materia
   Marcos Sousa Magalhaes     10  Portugues
   Jr Moraes                   5  Matematica

I have a question about how the function works levenshteinSim, bundle RecordLinkage.
I have two dataframes, dataNomeAlunos and dataNomeMatch, how it reacts when the columns have different amount of information?

And how could I use his feedback to identify the "match" ?

Dice.

dataNomeAlunos <- read.table(text = "
NomeAluno                  Nota  Materia
'Marcos Souza Magalhaes'     10  Port
'Junior Moraes'               5  Mat
'Mirela Chavier'              3  Port
", header = TRUE, stringsAsFactors = FALSE)

dataNomeMatch <- read.table(text = "
NomeAluno                  Nota  Materia
'Marcos Sousa Magalhaes'     10  Portugues
'Jr Moraes'                   5  Matematica
", header = TRUE, stringsAsFactors = FALSE)
No answers

Browser other questions tagged

You are not signed in. Login or sign up in order to post.