Identify duplicities and possible duplicities through Scores/ranking

Asked

Viewed 36 times

0

I have a database in mysql that has user information where I have to define what is duplicity or not.

Notes: The NOME_MAE and DATA_NASC fields are not mandatory. Examples of records: inserir a descrição da imagem aqui

Just doing the soundex of the person’s name and the mother’s does not help, because as in the example the word IGNACIO(I252) gives a soundex other than INACIO(I520). It intended to make a system of colors to identify the duplicity and the possible duplicities. I will add the INICIAIS_NOME variable to help with the soundex problem. But I have no idea how to mount this SCORE. Any help?? It can be in PHP or MYSQL. Thank you!

  • 1

    Take a look at this answer: Smart Query with Mysql

  • I have a function in Oracle that when comparing these two names it informs me that they are 92% similar. Something like this would help?

  • Hint of a way for you to try, I would make several queries, give Union, transform into subquery (put this after from) and give a sum, maybe use a CASE WHEN... https://dev.mysql.com/docrefman/5.7/en/case.html

  • So "@Sorack", about this link you informed I’m already doing this way.... the name problem is 90% solved, since I use a soundexBR for the names. The problem is when I have duplicities with different ages or no mother’s name in some...understood.... notice the example image.

  • @Reginaldorigo, I’ll do a search and see if you have the same for Mysql

No answers

Browser other questions tagged

You are not signed in. Login or sign up in order to post.