Changeset 3047
- Timestamp:
- Jan 3, 2013, 9:17:20 PM (8 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
cpc/trunk/project/batch/questions/parse.py
r2997 r3047 48 48 49 49 lastbr_re = re.compile('\s*<br\s*/?>$', re.U|re.M) 50 linebreaks_re = re.compile(r'[\s\r\n]+') 50 51 def extracttext(t): 51 52 div = t.parent.findNextSibling('div', attrs={'class': 'contenutexte'}) 52 53 text = div.decodeContents().strip() 53 return l astbr_re.sub('', text)54 return linebreaks_re.sub(' ', lastbr_re.sub('', text)) 54 55 55 56 def extractspan(t):
Note: See TracChangeset
for help on using the changeset viewer.