2
Dear users, I have a large file with the following columns
chr10_46938     EXON=28/28      STRAND=-1       ENSP=ENSGALP00000004070 SIFT=tolerated(0.38) 
chr10_46966     EXON=28/28      STRAND=-1       DOMAINS=Low_complexity_(Seg):Seg        SIFT=tolerated(0.66)    ENSP=ENSGALP00000004070   
chr10_46987     EXON=28/28      STRAND=-1       SIFT=tolerated(0.93)    ENSP=ENSGALP00000004070
chr10_47071     ENSP=ENSGALP00000004070 SIFT=tolerated(0.97)    EXON=28/28      STRAND=-1
chr10_47164     EXON=28/28      STRAND=-1       DOMAINS=Low_complexity_(Seg):Seg        SIFT=tolerated(0.37)    ENSP=ENSGALP00000004070
chr10_47466     ENSP=ENSGALP00000004070 SIFT=tolerated(0.11)    STRAND=-1       EXON=28/28    DOMAINS=PROSITE_profiles:PS50196,Pfam_domain:SSF50729
I want to select only the first column and the information SIFT=tolerated(..), but this is not found in fixed columns, example column 2. How to select only this information I wish to have for example the following output:
chr10_46938     SIFT=tolerated(0.38)  
chr10_46966     SIFT=tolerated(0.66)   
chr10_46987     SIFT=tolerated(0.93)  
chr10_47071     SIFT=tolerated(0.97)  
chr10_47094     SIFT=tolerated(1)            
chr10_47164     SIFT=tolerated(0.37)    
chr10_47466     SIFT=tolerated(0.11)
What command to use on UNIX to get this list?
You can use the
awkorcut.– stderr
I tried several mutlei commands and it didn’t work...if you can be more specific....
– Alex