What Is The Most Pythonic Way To Interleave Text File Contents?
Python question: If I have a list of files, how do I print line #1 from each file, then line #2, etc.? (I'm a Python newbie, obviously...) Example: file1: foo1 bar1 file2: foo2 ba
Solution 1:
forlinesin itertools.izip(*file_handlers):
sys.stdout.write(''.join(lines))
Solution 2:
> cat foo
foo 1
foo 2
foo 3
foo 4
> cat bar
bar 1
bar 2
> cat interleave.py
from itertools import izip_longest
from contextlib import nested
with nested(open('foo'), open('bar')) as (foo, bar):
for line in (line for pair in izip_longest(foo, bar)
for line in pair if line):
print line.strip()
> python interleave.py
foo 1
bar 1
foo 2
bar 2
foo 3
foo 4
compared to other answers here:
- files are closed on exit
- izip_longest doesn't stop when one file stops
- efficient use of memory
or, for multiple files (filenames
is a list of files):
with nested(*(open(file) for file in filenames)) as handles:
for line in (line fortuplein izip_longest(*handles)
for line intupleif line):
print line.strip()
Solution 3:
If all your files have the same number of lines, or if you want to stop as soon as any file is exhausted, Ignacio's answer is perfect. If you want to support files of different lengths, though, you should use the "round robin" recipe from the itertools
documentation:
defroundrobin(*iterables):
"roundrobin('ABC', 'D', 'EF') --> A D E B F C"# Recipe credited to George Sakkis
pending = len(iterables)
nexts = cycle(iter(it).nextfor it in iterables)
while pending:
try:
fornextin nexts:
yieldnext()
except StopIteration:
pending -= 1
nexts = cycle(islice(nexts, pending))
sys.stdout.writelines(roundrobin(*file_handlers))
Post a Comment for "What Is The Most Pythonic Way To Interleave Text File Contents?"