Board logo

标题: [求助]用gawk按列求和 [打印本页]

作者: hgx126     时间: 2011-1-3 01:46    标题: [求助]用gawk按列求和

一文本,有上万行,内容类似如下.
CODE:  [Copy to clipboard]
序号,机构号,金额,姓名,客户号,五级
1,5416,350000,熊清喜,48260000140870,次级
2,5416,50000,杨小平,48260000142862,次级
3,5416,4500,杨明成,48260000144168,次级
4,5417,7000,蒲猛,48260000144266,次级
5,5418,10000,杨小平,48260000142862,可疑
6,5417,5000,张瑞东,48260000145752,可疑
7,5417,500000,杨明成,48260000144168,可疑
8,5419,200000,张瑞东,48260000145752,损失
9,5417,25000,杨小平,48260000142862,正常
10,5417,15000,蒲猛,48260000144266,正常
我需要将同一客户号(第五列)的金额(第3列)相加,并插入到客户号列后。

处理后如下:
CODE:  [Copy to clipboard]
序号,机构号,金额,姓名,客户号,余额,五级
1,5416,350000,熊清喜,48260000140870,350000,次级
2,5416,50000,杨小平,48260000142862,85000,次级
3,5416,4500,杨明成,48260000144168,504500,次级
4,5417,7000,蒲猛,48260000144266,22000,次级
5,5418,10000,杨小平,48260000142862,85000,可疑
6,5417,5000,张瑞东,48260000145752,205000,可疑
7,5417,500000,杨明成,48260000144168,504500,可疑
8,5419,200000,张瑞东,48260000145752,205000,损失
9,5417,25000,杨小平,48260000142862,85000,正常
10,5417,15000,蒲猛,48260000144266,22000,正常
因为该文本较大(20M以上),要求用gawk或其它能快速处理文本的命令处理。
求高手相助。先谢了。
作者: dooomer     时间: 2011-1-6 08:24
用excel就可以了吧。gawk就不知道怎么处理了。
作者: freeants001     时间: 2011-1-6 16:41
CODE:  [Copy to clipboard]
BEGIN{
        FS=",";OFS=",";
        while(getline<ARGV[1]>0)arr[$4]+=$3;       
}
{
        print$1,$2,$3,$4,$5,(NR>1?arr[$4]:"余额"),$6;
}
[ Last edited by freeants001 on 2011-1-6 at 16:53 ]




欢迎光临 中国DOS联盟论坛 (http://cndos.fam.cx/forum/) Powered by Discuz! 2.5