Search in text GIANT

Asked

Viewed 42 times

0

I have here with me, a giant text, AMAZING, is a txt of approximately 30 GIGABYTES, it is compressed in 7z I want to ask for your help, to know how I can do research within it, like a cat | grep, which obviously wouldn’t work by file size since I have only 2GB of RAM, I want to try to do research on its content, without my machine crashing, preferably without having to extract

  • Addendum: 7z is 1.7 GB, 28 GB is unpacked

  • cat and grep would work yes, these commands do not retain the entire file in memory, but just enough to do the search. If you’re looking for a 5-letter word, grep would only keep 5 characters in the buffer.

  • 1

    Ever tried to use sed?

  • @epx and how I could use cat | grep without having to extract the file, there is a way?

  • @Danizavtz not tried yet, I will search

  • Use Sed or AWK, as it does not send the whole file to memory to try to be processed.

  • A "detail" that may not necessarily be needed here is to remember cloud computing. You can use Saas, pass or iass as you need. If you don’t have enough computational power or memory to run something specific, you can quickly raise a virtual machine, run the process and take down the VM next.

  • The parameters -so-bd pro 7z I think solve

  • the file after unpacking is a single text file?? If not indiscreet, where did this file come from?

  • It was that bullshit with Rasa, I got worried and decided to look for things there, I’m glad that there was nothing leaked

Show 5 more comments
No answers

Browser other questions tagged

You are not signed in. Login or sign up in order to post.