Posts by lmonferrari • 3,550 points

179 posts

1
votes

1
answer

122
views

A: Add a new column in a Dataframe after comparing data with another Dataframe?

What you want to do (at least it is what it looks like) does not need to join two data frames, because the data frame quotation itself already brings the information you want. df_cotacao['Moeda'] =…

python pandas
answered 4 years, 6 months ago lmonferrari 3,550
2
votes

2
answers

35
views

A: How to apply a ribbon to a dataframe based on the last characters of each label?

A solution similar to Lucas' would be to use the apply lambda df[df['A'].apply(lambda x: x[-3:] == 'BRL')] You could also use split df[df['A'].str.split('-').apply(lambda x: x[1] == 'BRL')] In both…

python pandas rowfilter
answered 4 years, 6 months ago lmonferrari 3,550
1
votes

1
answer

82
views

A: How to group data from another grouping?

A possible solution df['Variação_Salario'] = df.sort_values('Nome').groupby(['Nome'])['Salario'].pct_change().fillna(0).add(1) df['Porcentagem acumulada'] =…

python pandas numpy
answered 4 years, 6 months ago lmonferrari 3,550
1
votes

2
answers

33
views

A: Check whether a variable created through exec() exists

You can use dir() Python and iterate over it to check if it exists for e in dir(): if e == 'host5': print('ok') Or that way to check a range of names for i in range(len(dir())): if f'host{i}' in…

python variables exec
answered 4 years, 6 months ago lmonferrari 3,550
1
votes

4
answers

241
views

A: Filter list of python objects

You can use the lambda expression and test if the ID is 1 lista = [{'ID': 1, 'Name': 'Teste 1' }, {'ID': 2, 'Name': 'Teste 2' }] filtrado = filter(lambda x: x['ID'] == 1, lista) Exit [f for f in…

python list
answered 4 years, 6 months ago lmonferrari 3,550
2
votes

2
answers

63
views

A: Is there any way pd. Grouper, how much used for time frequencies, adds lines even when there are no records in a time interval?

You can use the function asfreq of pandas import pandas as pd df = pd.read_csv('./Antes do Agrupamento.csv', parse_dates=['Data']) df_agregado = df.groupby(['Numero Agrupado', pd.Grouper(key='Data',…

python pandas group-by
answered 4 years, 6 months ago lmonferrari 3,550
2
votes

2
answers

97
views

A: How to filter rows where columns meet consecutive conditions in Python?

You can use isin by creating a list of possible combinations. vl = ['LA','IA','LS','IS'] dados['promo'] = (dados.shift(axis = 1) + dados).isin(vl).any(axis = 1).astype(int) dados.shift 'move' the…

python function pandas
answered 4 years, 6 months ago lmonferrari 3,550
3
votes

3
answers

84
views

A: Access values from a python json

If you want to work with json instead of using data.text utilize data.json import requests data = requests.get('https://proxycheck.io/v2/42.131.121.100?vpn=1&asn=1') j = data.json()…

python json
answered 4 years, 6 months ago lmonferrari 3,550
0
votes

2
answers

331
views

A: Problem of wget in python

One way to solve is by using requests import requests import time import re header = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko)'} for i in…

python wget
answered 4 years, 6 months ago lmonferrari 3,550
1
votes

2
answers

64
views

A: Export Summary() as data-frame

An alternative dados1 <- read.csv("dados-originais.csv", header = T, skip = 0, sep = ",") sum(is.na(dados1)) dados1_summary <- data.frame(sapply(na.omit(dados1), summary)) Loading the data…

r
answered 4 years, 6 months ago lmonferrari 3,550
0
votes

1
answer

18
views

A: How to separate in x and y being all but the last x and the last y?

You can pass the column names as a list and slice them: def split(dados): SEED = 42367 X = dados[dados.columns.to_list()[:-1]] y = dados[dados.columns.to_list()[-1:]] train_x, test_x, train_y,…

python function
answered 4 years, 6 months ago lmonferrari 3,550
0
votes

2
answers

49
views

A: How do you sum all the values of the vector in Python?

from functools import reduce valores = [] for i in range(1, 11): valores.append(int(input(f'Informe o {i}º valor: '))) An alternative way is to use the function reduce. It aggregates the values in a…

python
answered 4 years, 6 months ago lmonferrari 3,550
3
votes

1
answer

35
views

A: How to check the periodicity of a series in the R?

You can use the package Tsstudio with the function ts_info ts_info(sua_serie) Output example: The a series is a xts object with 1 variable and 15 observations Frequency: quarterly Start time:…

r date
answered 4 years, 6 months ago lmonferrari 3,550
0
votes

2
answers

93
views

A: Date change of a monthly average of Pandas

If the question is only to change the day without changing the average, you can use apply with the replace method: df['data'] = df['data'].apply(lambda d: d.replace(day = 15)) Entree: data media 0…

python python-3.x
answered 4 years, 6 months ago lmonferrari 3,550
1
votes

2
answers

312
views

A: How to fill a column of a DF Pandas using, as a comparison, a specific column between this and another DF?

Importing the pandas import pandas as pd Loading the test files df = pd.read_csv('./df.csv') df2 = pd.read_csv('./df2.csv', sep = ';') df CNPJ DATA codprojeto 0 123 2020-12-02 00:00:00 UTC 0 1 123…

python pandas
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

2
answers

82
views

A: Print letters that are outside of Collections. Counter

import string from collections import Counter Picking up the ascii characters asciiLetters = string.ascii_letters Creating a dictionary with key(letter) and value(0) dicionario = {key:0 for (key) in…

python python-3.x
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

2
answers

212
views

A: Create date variable

You can use the data_range of pandas: import pandas as pd from datetime import date data_atual = date.today() datas = pd.date_range(start = '11/05/2020' , end = data_atual, freq='D')[::-1] The…

python-3.x pandas
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

2
answers

231
views

A: download with requests and open python

Importing the required packages, please note that I have placed warnings as it shows a ssl error import requests import zipfile import io import warnings warnings.filterwarnings('ignore') File url…

python python-3.x
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

1
answer

192
views

A: Python Api Data Filtering

Here you make the code of a json: dicionario = json.loads(requisicao.text) You can filter through the Keys of the dictionary: dicionario['USD']['name'] 'Dólar Comercial' dicionario['USD']['low']…

python api
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

1
answer

35
views

A: How to change image name with python?

To do this you can use the code below: import os from shutil import move Defining the path and extension of files caminho = 'C:/caminho/para_as_imagens' ext = '.jpg' Taking the list of file names in…

python
answered 4 years, 7 months ago lmonferrari 3,550
5
votes

2
answers

108
views

A: Automate column subtraction in R

You can subtract the data frame by "moving" the "day/column" by making a Slice. Here we have the data frame (first slice) df[3:ncol(df)] `02/11/2020` `03/11/2020` `04/11/2020` `05/11/2020`…

r
answered 4 years, 7 months ago lmonferrari 3,550
5
votes

1
answer

245
views

A: Relative Frequency Table - R / R Studio (% Daily Sales Determined Date/Product)

Loading packages and xlsx: library(readxl) library(lubridate) df <- read_excel('./tempo_atendimento.xlsx') Making some conversions: df$COD_PRODUTO <- as.factor(df$COD_PRODUTO) df$RANGE_DIAS…

r rstudio dplyr tidyverse
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

1
answer

197
views

A: Numeric types to Aggregate error

Your error occurs because you are trying to aggregate a string as if it were a numeric variable. You must turn the money column into numerical value: dados['money'] =…

python pandas csv
answered 4 years, 7 months ago lmonferrari 3,550
4
votes

3
answers

95
views

A: Date sequence from a range in R

Using dplyr and lubridate you can use rowwise that enables you to work line by line: library(dplyr) library(lubridate) nova_base <- base %>% rowwise() %>% do(data.frame(ID = .$ID, DATA =…

r date
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

1
answer

306
views

A: Sqlite - Python insert data automatically

If using the pandas to store the database values, you can do as follows: Importing the necessary packages import pandas as pd import sqlite3 Simulating incoming data and creating a data frame…

python sqlite
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

1
answer

36
views

A: Doubt when counting table elements in Chr format

I believe you can solve using dplyr Count: flights %>% count(dest))

r
answered 4 years, 7 months ago lmonferrari 3,550
2
votes

2
answers

285
views

A: How to bring more fields in the Pandas groupby, without necessarily having to use them in the grouping?

One solution would be to create a 'filter': filtro = df.groupby('data')['contador'].max() And then use the isin of the pandas: df[df['contador'].isin(filtro)].reset_index(drop = True) Exit: data…

python group-by
answered 4 years, 7 months ago lmonferrari 3,550
2
votes

1
answer

116
views

A: Select python - Pandas

Importing the package import pandas as pd Loading the files: lojas = pd.read_excel('./lojas.xlsx') produtos = pd.read_excel('./produtos.xlsx') Using Jay of the Pandas: novo_df =…

python excel pandas
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

1
answer

45
views

A: How to expand a dataframe based on a condition

importing the pandas package import pandas as pd Creating the data frame df = pd.DataFrame({ 'left_bound' : ['1', '4', '10', '25'], 'right_bound' : ['3', '9', '24', '50'], 'code' : ['a', 'b', 'c',…

python columns
answered 4 years, 7 months ago lmonferrari 3,550
5
votes

1
answer

45
views

A: Randomizing two sets of numbers, not repeating the values within each group (R)

Data frame.: df <- data.frame(ID = c(1, 1, 1, 3, 3, 3, 7, 7, 7)) Dyplr package: library(dplyr) Separating the problem into 2. Here we create set1 grouped by ID: df <- df %>% group_by(ID)…

r dplyr random-numbers
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

1
answer

195
views

A: Check if the content already exists in the text file

Sample tokens: tokens = [ 'Nzc3NzA2MTc2MzA4NzA3Mzc5.X7HVkg.v85rDccvWP-HJJxD_SMonOu', 'Nzc3NzA3ODI2Njg0MjMxNzEy.X7HXBw.CvxmjqeS8sW9Rx1sEy2ESLZ',…

python loop for
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

1
answer

39
views

A: list with values greater than those reported by the user

Importing the package: import pandas as pd Loading the data and creating a new column: dados = dados = pd.read_csv('./DadosClimaticos2018Londrina.csv', sep = ';', parse_dates = ['Data'])…

list pandas
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

2
answers

150
views

A: Cross-reference two different dataframes with different line numbers

You can use replace with a dictionary. Importing the package: import pandas as pd Creating the first data frame: Grau_Instr_Bibl = {'Categoria': ['Analfabeto', 'Até 5ª Incompleto', '5ª Completo…

python pandas
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

1
answer

145
views

A: define average function with pandas

Importing the pandas: import pandas as pd Reading the data and storing in a variable parse_dates converts the Date column to datetime64 dados = pd.read_csv('./DadosClimaticos2018Londrina.csv', sep =…

pandas csv
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

1
answer

323
views

A: Create dataframe pandas by dicionario 1 key and 1 value

If you want to use the values as index and the words as values you can use the following code: import pandas as pd pd.DataFrame(list(dic.keys()), index = dic.values()) In the first parameter we pass…

python pandas
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

1
answer

74
views

A: How do I access elements of a json that contains multiple keys in python?

One of the ways is to pass the Keys(keys) explicitly: import requests requests= requests.get("https://api.hgbrasil.com/finance/quotations?key={}") dados = requests.json() moeda = input("Moeda…

python json array api
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

2
answers

141
views

A: Addition of columns in csv file - Python

You can do it this way: Creating the month column: dfdados['Mes'] = pd.DatetimeIndex(dfdados['Data']).month Saving the csv: dfdados.to_csv('./novo.csv',sep = ';', index = False) More about the…

python python-3.x pandas csv
answered 4 years, 7 months ago lmonferrari 3,550
3
votes

5
answers

1031
views

A: Check that all items in the list are equal

You can make a function that checks if the next number is equal to the current number, if different already returns as soon as the list is not fully equal(False), otherwise if the for is until the…

python list
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

2
answers

266
views

A: Receive two numbers, add the pairs and multiply the odd

User inserts start and end value: valor1 = int(input()) valor2 = int(input()) Stores the sum of pairs and the multiplication of impairments: par = 0 impar = 1 Here the loop for iterates over the…

python list for
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

2
answers

47
views

A: Increase the number of columns in the histogram

Utilize breaks: hist(histo$MCP, xlab= "MCP área (ha)", ylab = "Frequência", breaks = 5) I believe Bins is used in ggplot.

r
answered 4 years, 7 months ago lmonferrari 3,550
0
votes

1
answer

68
views

A: How do I get the input to be on the same line?

You can use map, split and multiple assignments. As shown in the example below: D, R, L, P, G = map(int, input().split()) Split: is a method that returns a list from a string. Map: executes a…

python python-3.x
answered 4 years, 7 months ago lmonferrari 3,550
1
votes

2
answers

97
views

A: How to access a list that is inside another list in Python?

You could otherwise store as dictionary for example, but follows a possible answer to list within list: for i, aluno in enumerate(listaAlunos[0]): print(aluno) print(f'nota 1 -…

python python-3.x
answered 4 years, 8 months ago lmonferrari 3,550
0
votes

1
answer

203
views

A: How to save requests return and save to . txt or . csv file in Python

If you want to save the file exactly as it is in the list: import io with io.open('arquivo.txt', "w", encoding="utf-8") as file: file.write(str(lista))

python csv txt request
answered 4 years, 8 months ago lmonferrari 3,550
2
votes

1
answer

1640
views

A: How can I make two graphs in the same Python Plot in Jupyter Notebook?

You can create the chart this way: import matplotlib.pyplot as plt from pandas import read_excel df = read_excel('./carmen.xlsx', names = ['A','B','C','D']) Defining what will be plotted:…

python plot graphics anaconda
answered 4 years, 8 months ago lmonferrari 3,550
4
votes

1
answer

62
views

A: Fill date.frame using for output

You can create the data frame this way, passing the range as value and as index: import pandas as pd x = pd.DataFrame({'v1':list(range(1,11))}, index = list(range(1,11))) x Exit: v1 1 1 2 2 3 3 4 4…

python r for
answered 4 years, 8 months ago lmonferrari 3,550
2
votes

4
answers

201
views

A: How to reduce Python code 3?

An alternative with Join: n = int(input()) if n % 2 == 0: n +=1 ' '.join(str(v) for v in range(n, n + 12, 2)) or n = int(input()) ' '.join(str(v) for v in range(n, n + 12) if v % 2 == 1)…

python algorithm
answered 4 years, 8 months ago lmonferrari 3,550
2
votes

1
answer

219
views

A: How to take only the time of a Timestamp

You can do with the time function: import pandas as pd a = pd.to_datetime(1490195805, unit='s') str(a.time()) Exit: '15:16:45'

python pandas datetime timestamp
answered 4 years, 8 months ago lmonferrari 3,550
1
votes

2
answers

258
views

A: How to divide the value of an element of a column by delimiter (p.e "|") in pandas?

One way to do this is by using the function explode of own Pandas: df = pd.DataFrame(df['coluna1'].str.split('|').explode().reset_index(drop = True)) Entree: coluna1 0 ola|52 1 hey 2 sou 3 ja 4 da|5…

python string pandas
answered 4 years, 8 months ago lmonferrari 3,550
0
votes

3
answers

1399
views

A: Python Range - Numbers in descending order

The way you built the algorithm you need to get 2 from the start of the range(start, stop, step): numero = int(input('Informe um número inteiro positivo e par: ')) while numero <= 0 or numero % 2…

python
answered 4 years, 8 months ago lmonferrari 3,550
7
votes

3
answers

71
views

A: Conditional column based on multiple dplyr lines

I used the group_by + mutate + case_when + all to verify that all occurrences of the determined id were yes/no and those mixed would be missing values and filled with nd. library(dplyr) df %>%…

r dplyr
answered 4 years, 8 months ago lmonferrari 3,550