2
Dear users, I have a large file with the following columns
chr10_46938 EXON=28/28 STRAND=-1 ENSP=ENSGALP00000004070 SIFT=tolerated(0.38)
chr10_46966 EXON=28/28 STRAND=-1 DOMAINS=Low_complexity_(Seg):Seg SIFT=tolerated(0.66) ENSP=ENSGALP00000004070
chr10_46987 EXON=28/28 STRAND=-1 SIFT=tolerated(0.93) ENSP=ENSGALP00000004070
chr10_47071 ENSP=ENSGALP00000004070 SIFT=tolerated(0.97) EXON=28/28 STRAND=-1
chr10_47164 EXON=28/28 STRAND=-1 DOMAINS=Low_complexity_(Seg):Seg SIFT=tolerated(0.37) ENSP=ENSGALP00000004070
chr10_47466 ENSP=ENSGALP00000004070 SIFT=tolerated(0.11) STRAND=-1 EXON=28/28 DOMAINS=PROSITE_profiles:PS50196,Pfam_domain:SSF50729
I want to select only the first column and the information SIFT=tolerated(..)
, but this is not found in fixed columns, example column 2. How to select only this information I wish to have for example the following output:
chr10_46938 SIFT=tolerated(0.38)
chr10_46966 SIFT=tolerated(0.66)
chr10_46987 SIFT=tolerated(0.93)
chr10_47071 SIFT=tolerated(0.97)
chr10_47094 SIFT=tolerated(1)
chr10_47164 SIFT=tolerated(0.37)
chr10_47466 SIFT=tolerated(0.11)
What command to use on UNIX to get this list?
You can use the
awk
orcut
.– stderr
I tried several mutlei commands and it didn’t work...if you can be more specific....
– Alex