In Praise of Serialization via __repr__ (Revised)
The problem is this: parsing HTML from a variety of sources to create pleasant, easy-to-manipulate spreadsheets. HTML can represent the same data in a variety of ways, even within a single web site. What we have are several layers of parsing; layers which must be linked with increasingly simplified intermediate …
more ...