Measured data in a 5min range - What language to use. Prog?

Asked

Viewed 67 times

-1

Working with automatic measurements every 5 min in a database, as illustrated:

estrutura dados no excel

I usually use Excel for analysis (sum, mean, distribution, etc.). However, due to the large volume of data, about 500 thousand lines, excel does not account for the processing. I would like to develop an application that only does the analysis I need, but I don’t know what would be the most correct way to store the data (csv, txt, mysql, etc.) or what is the most suitable programming language for data processing of this type.

I’d like your opinion.

  • I understand well that at this time still has no database, only excel? Where do the data/ measurements come from? what kind of accounts do you want to do?

  • Only excel and files . csv . This data comes from dataloggers that monitor solar power systems and weather stations.

  • And how do the data get to the computer? via internet, file, or handwritten?

  • A file for each month, where each file has a file for each day, for example: C:/January/ has the archives: 01-01-2014.csv, 02-01-2014.csv, ...

  • R is an interesting language to work with data analysis.

  • Don’t invent the wheel, there are statistical software like SAS ($) and R (free).

  • Thank you very much, guys, I’m going to do a little research on the options you’ve provided.

Show 2 more comments

1 answer

0


I work with CSV’s that have large sizes, such as 10Gb, 12Gb and so on. Basically I use Python with some libraries that help me in performance and facilitate some operations, take a look:

Pandas: http://pandas.pydata.org/ Numpy: http://www.numpy.org/

  • Is it possible to create applications like c# and VB with Python? Grateful.

  • No. They’re very different things.

Browser other questions tagged

You are not signed in. Login or sign up in order to post.