1
I am trying to make a Scrapping of a website, and would like to capture the value that follows the pattern below:
Advogado: XXXXX Número do Processo: XXXXXX OutroCampo: XXXXX
usually what separates this information is a space, so would be what would be captured is, for example, Lawyer: Bill Gates
(here would have a space/tab)
Standard:
NOME_DO_CAMPO:(optional space)Valor a ser capturado
(final space)
I started with this regex, but it only captures the beginning and not the value "between"
regex: \w+:\s{1}
It is because the field (Lawyer) may vary, so I used the w+
– Thiago
and the "process number" as well
– Thiago
If the data may vary, I suggest you edit the question and put a few more cases. But I already say that, if both the value of the field (name and surnames) and what comes after ("Process number") are separated by spaces, it is difficult to know where one ends and another begins
– hkotsubo