Posts by lmonferrari • 3,550 points

179 posts

0
votes

1
answer

44
views

A: PANDAS - PYTHON - FIND DIFFERENT VALUES

You can use the isin of pandas Importing the pandas import pandas as pd Creating the dataframes: Novo_Mailing_df = pd.read_csv('../DADOS/Novo_Mailing.csv', sep = ';', names=['Coluna1'])…

python pandas join
answered 4 years ago lmonferrari 3,550
0
votes

1
answer

25
views

A: Python - append in the same Dataframe

You can use the pandas append, in which case a new data frame was created to go adding the results to it: import pandas as pd import requests header = { "User-Agent": "Mozilla/5.0 (X11; Linux…

python pandas
answered 4 years ago lmonferrari 3,550
0
votes

1
answer

31
views

A: Difficulty Merging Sequential Columns in a Dataframe with Pandas

You can use the Concat of pandas, passing data frames and column axis: parte3 = pd.concat([parte1, parte2], axis=1)

python pandas csv
answered 4 years ago lmonferrari 3,550
4
votes

3
answers

54
views

A: Replacing NA values of a column by the value of the top row of the same column of a dataframe

An alternative would be to use Fill package tidyr: library(tidyr) DADOS <- data.frame( a = c(1, 2, NA, 3, 4), b = c(5, 6, 7, NA, NA) ) DADOS %>% fill(a,b) Exit: a b 1 1 5 2 2 6 3 2 7 4 3 7 5 4…

r
answered 4 years, 1 month ago lmonferrari 3,550
0
votes

1
answer

31
views

A: Create dataframe pandas 1 key and some non-standard values in the dictionary

import pandas as pd dicionario = {0:[['tela1'],['tela2'],['tela3']], 1:[['tela2']], 2:[['tela5'],['tela7']], 4:[['tela1'],['tela3']]} df = pd.DataFrame.from_dict(dicionario, orient='index') df =…

python pandas
answered 4 years, 2 months ago lmonferrari 3,550
3
votes

2
answers

77
views

A: In a Dataframe, modify data from one column conditioned to the value of another column

In addition to the @Augusto Vasques suggestion, you can use Oc as you previously tried: df.loc[df['Side'] == 'BUY', 'Amount'] = -df['Amount'] Loc + isin df.loc[df['Side'].isin(['BUY']), 'Amount'] =…

python pandas conversion
answered 4 years, 2 months ago lmonferrari 3,550
2
votes

1
answer

66
views

A: How to save the generated graphics files (in png) within a loop

Add the png line with the location where you want to save(you have to have write permission so change as in the example below to your image folder) At the end of your chart generation add dev.off()…

r loop
answered 4 years, 4 months ago lmonferrari 3,550
2
votes

2
answers

43
views

A: move data left in pandas

Follow a possible solution by slicing, then using shift to move the columns import pandas as pd import numpy as np tabelas =…

python pandas
answered 4 years, 4 months ago lmonferrari 3,550
1
votes

1
answer

41
views

A: Sum pandas columns by row and selecting comparative by Qgrid row

Importing the libs import pandas as pd import seaborn as srn import statistics as sts Loading the data dataset = pd.read_excel('/content/drive/MyDrive/Data science /BRA 2020.xlsx') Excluding the…

python pandas
answered 4 years, 4 months ago lmonferrari 3,550
0
votes

2
answers

185
views

A: Beautifulsoup: Catch text inside table

As stated in the other answer, a template is loaded to be fed, so requests cannot get the correct values. Using requests_html Importing the lib from requests_html import HTMLSession Creating the…

html python table web-scraping beautifulsoup
answered 4 years, 4 months ago lmonferrari 3,550
0
votes

1
answer

33
views

A: Separate a Dataframe

Maybe the groupby and the pct_change pandas help you df['pct_change_new_confirmed'] = df.groupby('state')['new_confirmed'].pct_change().fillna(0) df['pct_change_new_deaths'] =…

python pandas
answered 4 years, 4 months ago lmonferrari 3,550
4
votes

4
answers

1234
views

A: Python - Calculate transposed matrix

Source matrix M =[[1,2],[3,4],[5,6]] Printing for j in M: print(j) Exit [1, 2] [3, 4] [5, 6] Transposed Creating the matrix M_t = list(map(list, zip(*M))) Printing for j in M_t: print(j) Exit [1, 3,…

python matrix map
answered 4 years, 4 months ago lmonferrari 3,550
1
votes

1
answer

51
views

A: Python Dataframe dynamically

I believe you can create a 'temporary' data frame, make the predictions and save in an xlsx for example: for cod in dados['Codproduto'].unique(): df_temp =…

python pandas for
answered 4 years, 4 months ago lmonferrari 3,550
2
votes

2
answers

63
views

A: How to add a new column with the group average in pandas?

Importing the pandas import pandas as pd Reading the dataset dataset = pd.read_excel('./BRA.xlsx') Removing unnecessary columns dataset.drop(columns=['League','Country','Time','Date'], inplace=True)…

pandas mean
answered 4 years, 4 months ago lmonferrari 3,550
3
votes

2
answers

35
views

A: How to export [[ ]] from a list in separate excel files using R?

... library(xlsx) producao_per_discente<-split(producao, producao$discente) lapply(seq_along(producao_per_discente), function(i) write.xlsx(producao_per_discente[[i]], file =…

r export split
answered 4 years, 4 months ago lmonferrari 3,550
0
votes

1
answer

42
views

A: How To Order In R

You can use the Sort dados = c(8,6,5,4,1,3,7) dados = sort(dados, decreasing = F) dados Exit 1 3 4 5 6 7 8

r
answered 4 years, 4 months ago lmonferrari 3,550
0
votes

1
answer

55
views

A: Overwrite all data from a column of a data frame in R

Using this data ae <- c(1,2,3,4,5,6,7,8,9,10) be <- c(10,9,8,7,6,5,4,3,2,1) pnadc1 <- data.frame(ae,be) You can reassign values this way pnadc1$ae <- 200 Exit ae be 1 200 10 2 200 9 3…

rstudio
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

1
answer

37
views

A: Join two columns with different dates

... df_sp1 = DataReader('^GSPC', data_source='yahoo', start='2020-1-1', end="2020-02-04") df_sp2 = DataReader('^GSPC', data_source='yahoo', start='2021-1-1') You can use the Concat novo_df =…

python python-3.x pandas
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

2
answers

97
views

A: How do I filter a Dataframe row by knowing a String value from one of its columns?

Creating Data Frame Test import pandas as pd codigos = ['cod1','mxrf11','cod2','mxrf11','cod3','mxrf11'] valores = ['teste1','teste2','teste3','teste4','teste5','teste6'] df =…

python python-3.x pandas
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

1
answer

88
views

A: transform data from a Dataframe column into a single string

You can use the to_string df['Coluna'].to_string() import pandas as pd palavras = ['ola','como','vai','você?'] dados = pd.DataFrame({'Texto': palavras}) dados Dice Texto 0 olá 1 como 2 vai 3 você?…

python pandas nltk
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

2
answers

62
views

A: Perform a previous values calculation in column on R

Maybe this will help you, using dplyr Data test frame dados <- data.frame(coluna_1 = c(558.8, 584.3, 603.3)) The logic library(dplyr) dados <- dados %>% mutate(coluna_2 = case_when(…

python r excel
answered 4 years, 5 months ago lmonferrari 3,550
3
votes

1
answer

52
views

A: How to split columns/data with a specific limit?

You can determine a chunksize pro value import pandas as pd # tamanho da fatia tamanho = 5000 for fatia in pd.read_csv('./arquivo.csv', chunksize = tamanho): # seu código aqui…

python pandas machine-learning
answered 4 years, 5 months ago lmonferrari 3,550
2
votes

2
answers

53
views

A: How to change abbreviated values in a DF using Pandas in Python

In addition to the response of Lucas (that I even prefer), you can keep the basis of your code Creating the test data frame import pandas as pd dados = ["35,57B", "6,85T"] df =…

python pandas date
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

1
answer

68
views

A: Lapis design effect with opencv and python

import cv2 # abrindo a imagem em escala de cinza img_gray = cv2.imread('wonder-woman.png', cv2.IMREAD_GRAYSCALE) # calculando o inverso, 255 é branco 0 é preto e aplicando o blur img_gray_inv = 255…

python opencv image-processing
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

2
answers

81
views

A: format data from a column in a data.frame in R

An example using the dplyr valores_1 <- c('24','25','34','234','0045', '1234') dados <- data.frame(Coluna1 = valores_1, stringsAsFactors = FALSE) library(dplyr) dados %>% mutate( Coluna1 =…

r
answered 4 years, 5 months ago lmonferrari 3,550
2
votes

2
answers

55
views

A: How to calculate the average for groups and identify the maximum value?

One way to return the values in an "ordered" way is to use the reset_index dfseason = df.groupby(by='Month', sort=True)['Billed'].sum().nlargest(1).reset_index() dfseason Exit Month Billed 0 May 918…

python pandas
answered 4 years, 5 months ago lmonferrari 3,550
2
votes

1
answer

42
views

A: Python: I need to get the y coordinate given to x coordinate

import matplotlib.pyplot as plt from numpy import polyfit Defining known X and Y X = [0, 5] Y = [2, 4] By calculating the coefficients m, b = polyfit(X, Y, deg=1) New x for calculation x = 2.5 Line…

python plot
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

3
answers

64
views

A: optimize Camelot large pdf files

Maybe this will give some optimized import camelot, PyPDF2, tqdm import pandas as pd from tkinter import Tk, filedialog as dlg Tk().withdraw() file_path = dlg.askopenfilename() last_page =…

python pandas
answered 4 years, 5 months ago lmonferrari 3,550
3
votes

2
answers

260
views

A: In Python, how do you remove specific characters from all the records of just one particular column?

You can use apply and slicing the string raw_data['nome_arquivo'] = raw_data['nome_arquivo'].apply(lambda x: x[:-4]) You can also use replace raw_data['nome_arquivo'] =…

python string pandas ipython-notebook
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

1
answer

33
views

A: In Python E Jupyter Notebook, how to present a full screen record?

According to the documentation you can set this using max_colwidth. pd.set_option("max_colwidth", 40) 0 1 2 3 0 foo bar bim uncomfortably long string 1 horse cow banana apple Setting a lower value…

python string pandas ipython-notebook
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

1
answer

41
views

A: Pivoting in Pandas

See if this way works for you df.pivot_table(index = 'ID_PACIENTE', columns = 'DE_ANALITO', values = 'DE_RESULTADO', aggfunc = ''.join).reset_index().rename_axis(None, axis = 1) or df.pivot(index =…

python pandas
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

2
answers

64
views

A: Select columns from a base without having to read the whole file

dados19 <- read.csv('./SUP_ALUNO_2019.CSV', sep = '|', dec = '.', colClasses = c('NULL','NULL','NULL','NULL','integer', 'NULL','NULL','NULL','NULL','NULL', 'NULL','NULL','NULL','NULL','integer',…

r
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

3
answers

125
views

A: How to add one column of data based on another in excel through Pandas?

Importing the libs import pandas as pd import numpy as np Creating the test df conteudo = ['1 X 40 CONTAINERS 40 BAGS OF FLUTRIAFOL TECNICO SINON FLUTRIAFOL 97% TECH', '1 X 20 CONTAINERS 20 BAGS OF…

python excel pandas
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

1
answer

75
views

A: NLP Text sorting using Python

You can work with a library that makes a string Fuzzy. String fuzzy is used to find similarities in strings even if there is some typing error. Fuzzywuzzy works with Levenshtein distance to…

python machine-learning
answered 4 years, 5 months ago lmonferrari 3,550
3
votes

1
answer

71
views

A: Valueerror error: 1 Columns passed, passed data had 12 Columns

The first error for your case occurs here, where you pass a 'list' list' conteudo2 = [['Pontos Ganhos','Vitórias','Empates','Derrotas','Saldo de Gols','Gols Pró','Gols Contra','Chance de…

python pandas
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

1
answer

85
views

A: Python/Pandas: Treatment of TXT

A possible solution using pandas import pandas as pd # carregando os dados e atribuindo nomes as colunas colunas = ['Data','id_m','id_c','Data inicial','Data…

python pandas numpy
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

1
answer

133
views

A: Abstract class 'Excelwriter' with Abstract methods instantiatedpylint(Abstract-class-instantiated)

As the documentation says, when you want to save more than one sheet in the same file you need to declare the Excelwriter object with pd.ExcelWriter('teste.xlsx') as writer: df1.to_excel(writer,…

python excel pandas
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

2
answers

105
views

A: Join lines from a Python column

An alternative using pandas groupby import pandas as pd import numpy as np Creating the test data frame dados = pd.DataFrame({'Dados':np.random.randint(1,100, 43184)}) Calculating the average with…

python
answered 4 years, 5 months ago lmonferrari 3,550
0
votes

1
answer

26
views

A: return more recent files to a folder

from pathlib import Path import pandas as pd directory = Path('./') files = list(directory.rglob('*.*')) raw_data = [[item.name,item.stat().st_mtime] for item in files] df = pd.DataFrame(raw_data,…

pandas path
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

1
answer

62
views

A: Removing Symbols in Python dataframe columns

Data Test Frame import pandas as pd import re titulo = ['[Cobra Kai]', '[Bridgerton]', '[Vikings]'] genero = ['[\nAction, Comedy, Drama]', '[\nDrama, Romance]','[\nAction,Adventure, Drama]'] ano =…

replace
answered 4 years, 5 months ago lmonferrari 3,550
1
votes

1
answer

433
views

A: Python Pandas: Dataframe convert Timestamp column to Datetime

import pandas as pd data = [1610323200000,1610409600000,1610409600000,1610496000000,1610582400000] volume = [38150.02,35410.37,34049.15,37371.38,39145.21] abertura =…

python pandas datetime timestamp
answered 4 years, 5 months ago lmonferrari 3,550
2
votes

1
answer

32
views

A: Creating a new column using for

Dice df_vot = pd.read_csv('./Dados aula04/votacao_partido_munzona_2020_BRASIL.csv', sep = ';', encoding = 'latin1') centro = ['AVANTE', 'MDB', 'PROS', 'PSDB', 'SOLIDARIEDADE'] direita = ['DC',…

python pandas
answered 4 years, 6 months ago lmonferrari 3,550
2
votes

1
answer

31
views

A: Extract only uppercase words with R

You can check if there is more than one occurrence of uppercase letter within the word limit str_extract_all(teste ,'\\b[A-Z]+\\b') or str_extract_all(teste, "\\b[:upper:]+\\b")…

r string
answered 4 years, 6 months ago lmonferrari 3,550
2
votes

1
answer

88
views

A: Error printing HTML with Beautifulsoup

import requests from bs4 import BeautifulSoup as bs headers = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko)'} url =…

html python python-3.x python-requests beautifulsoup
answered 4 years, 6 months ago lmonferrari 3,550
1
votes

1
answer

30
views

A: How to edit values in a column?

Maybe this will help you library(lubridate) df$dia <- day(df$date) df$ano_mes <- paste0(year(date),'-',month(date)) Using the lubridate package we were able to extract the day, month and year.…

r rstudio
answered 4 years, 6 months ago lmonferrari 3,550
5
votes

1
answer

90
views

A: How to use Beautifulsoup’s "find" to find a script tag with a specific type?

import requests from bs4 import BeautifulSoup as bs def get_cod_produto(url): response = requests.get(url) data = response.text soup = bs(data, 'html.parser') return soup.find('script',…

html python beautifulsoup
answered 4 years, 6 months ago lmonferrari 3,550
4
votes

1
answer

62
views

A: Make a graph in R (ggplot) similar to Excel bar charts

library(dplyr) library(tidyr) library(ggplot2) df <- read.csv2('./Tabela_areas_referencias_porcent_2.csv') df_pivoted <- pivot_longer( data = df, cols = c("Vegetação_Nativa",…

r ggplot2
answered 4 years, 6 months ago lmonferrari 3,550
5
votes

2
answers

70
views

A: Error plotting with ggplot

An example creating the sequence of dates with n equal to zero library(ggplot2) library(dplyr) df <-data.frame( ano = c(2007, 2008, 2017, 2018), n = c(1, 2, 2, 1) ) anos <- data.frame(ano =…

r ggplot2
answered 4 years, 6 months ago lmonferrari 3,550
3
votes

1
answer

53
views

A: How to delete null lines in a Dataframe?

You can return those that have no missing values this way: df[~df.isnull()] ~ serves to deny, ie is null turns into a kind of not null.

python pandas
answered 4 years, 6 months ago lmonferrari 3,550
1
votes

1
answer

69
views

A: Compare the information of two data.frames (tables) to create groups and a third column. in R

You can use the between by comparing the data frame indexes in the case: library(dplyr) rows <- rownames(CBO2002) CBO2002 <- CBO2002 %>% mutate(grupo = case_when( between(rows,1,28) ~…

r
answered 4 years, 6 months ago lmonferrari 3,550