ZIP文件格式是一种常见的存档和压缩标准,这个zipfile模块提供了工具来创建、读取、写入、附加和列出一个ZIP文件。使用ZIP64扩展(即压缩文件大小超过4G),它能解压加密的ZIP文件,解密过程很慢。
1、测试是否为ZIP文件
is_zipfile()函数会返回一个布尔值来表示是否为ZIP文件,代码如下:
#!/usr/bin/python import zipfile for filename in ['print_name.py', 'python.zip', 'uwsgi', 'admin']: print '%20s %s' % (filename, zipfile.is_zipfile(filename))
如果文件不存在或者不是ZIP文件会返回False。
[root@www home]# python zipfile_is_zipfile.py print_name.py False python.zip True uwsgi False admin False
2、读取ZIP文件的内容
#!/usr/bin/env python import zipfile zf = zipfile.ZipFile('python.zip', 'r') print zf.namelist()
使用namelist() 函数,返回结果是一个列表
[root@www home]# python zipfile_namelist.py ['test.txt']
#这只能查看ZIP文件的部分内容,使用infolist() 或者 getinfo() 可以从ZIP文件中获取更多信息,代码如下:
import datetime import zipfile def print_info(archive_name): zf = zipfile.ZipFile(archive_name) for info in zf.infolist(): print info.filename print '\tComment:\t', info.comment print '\tModified:\t', datetime.datetime(*info.date_time) print '\tSystem:\t\t', info.create_system, '(0 = Windows, 3 = Unix)' print '\tZIP version:\t', info.create_version print '\tCompressed:\t', info.compress_size, 'bytes' print '\tUncompressed:\t', info.file_size, 'bytes' print if __name__ == '__main__': print_info('python.zip')
执行上面代码显示如下结果:
[root@www home]# python zipfile_infolist.py test.txt Comment: Modified: 2013-09-06 20:09:58 System: 3 (0 = Windows, 3 = Unix) ZIP version: 20 Compressed: 419430400 bytes Uncompressed: 419430400 bytes
使用getinfo()函数可以在ZIP文件内查找内容,代码如下:
#!/usr/bin/env python import zipfile zf = zipfile.ZipFile('python.zip') for filename in ['test.txt', 'notthere.txt']: try: info = zf.getinfo(filename) except KeyError: print 'ERROR: Did not find %s in zip file' % filename else: print "%s is %d bytes" % (info.filename, info.file_size)
如果需要查找的不存在ZIP文档里,会返回一个KeyError错误。
[root@www home]# python zipfile_getinfo.py test.txt is 419430400 bytes ERROR: Did not find notthere.txt in zip file
3、从一个ZIP文档中提取文件
#!/usr/bin/env python import zipfile zf = zipfile.ZipFile('python.zip') for filename in ['test.txt', 'notihere.txt']: try: data = zf.read(filename) except KeyError: print 'ERROR: Did not find %s in zip file' % filename else: print filename, ':' print repr(data) print
要提取的文件会被自动解压:
[root@www home]# python zipfile_read.py README.txt : 'The examples for the zipfile module use this file and example.zip as data.\n' ERROR: Did not find notthere.txt in zip file
4、创建一个新的ZIP文件
创建新的ZIP归档文件代码如下:
#!/usr/bin/env python from zipfile_infolist import print_info import zipfile print 'creating archive' zf = zipfile.ZipFile('zipfile_write.zip', 'w') try: print 'adding text.txt' zf.write('text.txt') finally: print 'closing' zf.close() print print_info('zipfile_write.zip')
默认情况下不会对文件进行压缩:
[root@www home]# python zipfile_write.py creating archive adding text.txt closing text.txt Comment: Modified: 2013-09-06 20:39:52 System: 3 (0 = Windows, 3 = Unix) ZIP version: 20 Compressed: 104857600 bytes Uncompressed: 104857600 bytes
如果要对文件进行压缩,zlib模块是必须的,如果zlib可以使用,你可以使用zipfile.ZIP_DEFLATED设置压缩模式为单个文件或者归档一个整体。默认的压缩模式是zipfile.ZIP_STORED。
#!/usr/bin/env python from zipfile_infolist import print_info import zipfile try: import zlib compression = zipfile.ZIP_DEFLATED except: compression = zipfile.ZIP_STORED modes = { zipfile.ZIP_DEFLATED: 'deflated', zipfile.ZIP_STORED: 'stored', } print 'creating archive' zf = zipfile.ZipFile('zipfile_write_compression.zip', mode='w') try: print 'adding text.txt with compression mode', modes[compression] zf.write('text.txt', compress_type=compression) finally: print 'closing' zf.close() print print_info('zipfile_write_compression.zip')
这次文件被压缩:
[root@www home]# python zipfile_write_compression.py creating archive adding text.txt with compression mode deflated closing text.txt Comment: Modified: 2013-09-06 20:39:52 System: 3 (0 = Windows, 3 = Unix) ZIP version: 20 Compressed: 101923 bytes Uncompressed: 104857600 bytes
替换原始文件名:
#!/usr/bin/env python from zipfile_infolist import print_info import zipfile zf = zipfile.ZipFile('zipfile_write_arcname.zip', mode='w') try: zf.write('text.txt', arcname='NOT_README.txt') finally: zf.close() print_info('zipfile_write_arcname.zip')
结果显示原始文件名已经被替换了:
[root@www home]# python zipfile_write_arcname.py NOT_README.txt Comment: Modified: 2013-09-06 20:39:52 System: 3 (0 = Windows, 3 = Unix) ZIP version: 20 Compressed: 104857600 bytes Uncompressed: 104857600 bytes
更多博文请移步:linux开源技术博客 http://www.chlinux.net