ChangeLog


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246

2009-09-25  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/__init__.py: bump version to 3.9.1

2009-09-25  Seth Vidal <skvidal@fedoraproject.org>

	* makefile: clean up everything in make clean

2009-09-25  Seth Vidal <skvidal@fedoraproject.org>

	* test/runtests.py, test/test_grabber.py, test/test_keepalive.py,
	urlgrabber/__init__.py, urlgrabber/byterange.py,
	urlgrabber/grabber.py, urlgrabber/keepalive.py,
	urlgrabber/mirror.py, urlgrabber/progress.py,
	urlgrabber/sslfactory.py: cleanup all the old urlgrabber urllib code
	that's not being used delete sslfactory and keepalive fix up the
	unittests to match the existing code

2009-09-24  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: update documentation for ssl options and
	size/max_header_size options

2009-09-23  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - fix the reget testcases (and regets in general) with the max size
	check - make the errorcode more obvious when we go over the range -
	obviously don't do the check if all of our max values are None (or
	even 0 since that is a silly number for a Max)

2009-09-22  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: handle endless-data problems safely:  "A
	malicious server could cause libcurl to download an infinite amount
	of data, potentially causing all of memory or disk to be filled.
	Setting the CURLOPT_MAXFILESIZE_LARGE option is not sufficient to
	guard against this.  Instead, the app should monitor the amount of
	data received within the write or progress callback and abort once
	the limit is reached."  had to restructure a good bit of the error
	handling to do this but it works for both endless headers and
	endless content.

2009-09-21  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: make sure the value we get back from the
	parse150 and other calls is converted to an int before we make it
	'size' rhbug: #524705

2009-09-02  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: make file:// url not found msgs clearer and
	hopefully fix a couple of ctrl-c issues.

2009-08-27  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: make proxy=_none_ properly disable all
	proxies as per the docs

2009-08-14  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - add full contingent of ssl options:   - client keys   - client
	certs   - capath/cainfo   - client key passwords   - client key and
	cert types   - verifypeer/verifyhost - add a number of common errors
	to do_perform() - when an error is unknown, and doesn't make sense
	report complete pycurl error code - when the filename is '' and not
	None and we're doing a urlgrab() try to open the   file anyway
	rather than silently swallowing the data into a StringIO and
	discarding it.

2009-08-13  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: add _to_utf8() method to pycurlfileobject
	make sure postfield data is to_utf8'd before setting the option
	otherwise pycurl is unhappy if the postfield data is a unicode
	object instead of a string object. closes rh bug
	https://bugzilla.redhat.com/show_bug.cgi?id=515797

2009-08-12  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: initial pass at setting more advanced ssl
	options. verify peer and verify host work as expected.

2009-08-07  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: keep from making tmpfiles all over /tmp on
	any local file:// urlopen() by doing it in StringIO instead of
	mkstemp().  Sort of fixes
	https://bugzilla.redhat.com/show_bug.cgi?id=516178

2009-08-06  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - fix intrrupt handler and document why keyboardinterrupt is going
	to be so weird in pycurl - disable signals and make sure we don't
	handle/intercept any in the pycurl code. - set 'check_timestamp'
	regets as NotImplemented. The work around is multiple connections.
	it is possible but not immediately useful since, afaict, NOTHING
	uses the check_timestamp regets.

2009-08-05  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - make sure regets work when our filename is unicode - make sure we
	are not resetting self.append = False when we don't need to

2009-08-05  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - make sure we tell pycurl to get the filetime when downloading -
	set a couple of options as 'True/False' instead of 1,0 - for
	readability - make sure the option passed to timeout is an int - not
	a string

2009-08-04  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: missed setting the value from opts.timeout
	- doesn't really HURT what will happen b/c if your connect takes
	longer than 5minutes then you're SCREWED

2009-08-04  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: handle timeouts more correctly (with the
	exception) and set timeouts to be connect timeouts since libcurl
	seems to actually honor timeouts - as opposed to urllib. closes rh
	bug # 515497

2009-07-31  Seth Vidal <skvidal@fedoraproject.org>

	* ChangeLog, makefile, urlgrabber/__init__.py: changelog + release
	date touchup

2009-07-31  Seth Vidal <skvidal@fedoraproject.org>

	* makefile: add a few more things to be cleaned out

2009-07-31  Seth Vidal <skvidal@fedoraproject.org>

	* ChangeLog: update changelog

2009-07-31  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - make readlines() work for mirrorlists in yum (which probably
	shouldn't be using it anyway) - do a do_grab() in _do_open() which
	may or may not be a good idea - I could also make the _do_grab()
	happen when someone attempts to hit a method beyond the file object
	open

2009-07-30  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - make basic posts work

2009-07-30  Seth Vidal <skvidal@fedoraproject.org>

	* maint/git2cl: add git2cl

2009-07-30  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: when I first started this I hacked
	something into URLGrabberFileObject - this reverts that hack

2009-07-30  Seth Vidal <skvidal@fedoraproject.org>

	* ChangeLog, maint/cvs2cl.pl, maint/usermap, test/runtests.py,
	urlgrabber/__init__.py: - clean up some unused files - update the changelog - bump the
	version - update the copyright in a couple of places

2009-07-30  Seth Vidal <skvidal@fedoraproject.org>

	* MANIFEST.in, makefile: - make makefile work again without using cvs - add makefile to
	MANIFEST.in

2009-07-30  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - make simple/most proxies work - remove unnnecessary 'have_range'
	check for pycyurl obj

2009-07-29  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: - add range support - get rid of the .part file thing - it makes
	range-regets harder than they need to be - make sure regets behave

2009-07-29  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: implement throttle/bandwidth controls in
	pycurl tested with the progress call back - seems to work very well


2009-07-29  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: get the content-length/size for ftp pkgs
	too - steals parse150 from ftplib. Should work for A LOT of ftp
	servers, but not all of them - add self.scheme for which protocol
	we're using here.

2009-07-29  James Antill <james@and.org>

	* urlgrabber/byterange.py: Import fix for ftp ports in old urilib
	code (probably worthless now, but meh)

2009-07-29  James Antill <james@and.org>

	* urlgrabber/progress.py: Import progress patches from Fedora.
	These were done over a couple of years:    . cleanup UI.   . dynamic
	terminal widths.     . deal with serial console.   . total download
	stuff.

2009-07-28  Seth Vidal <skvidal@fedoraproject.org>

	* test/runtests.py, urlgrabber/grabber.py: implement
	PyCurlFileObject. This makes the default and forklifts all the code
	to pycurl. This is not finished but is functional for a significant
	number of the tests. things known to be broken: - proxies - http
	POST - non-header-based byte-ranges - certain types of read
	operations when downloading a file to memory instead of to a
	filename

2009-05-15  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: make it use *args instead of silly if
	statements

2009-05-15  Seth Vidal <skvidal@fedoraproject.org>

	* urlgrabber/grabber.py: modify urlgraberror so it has a url
	attribute and includes the url in all error messages.

2006-12-12  mstenner <mstenner>

	* urlgrabber/grabber.py: more debugging code to expose options

2006-12-08  mstenner <mstenner>

	* scripts/urlgrabber, test/test_grabber.py, urlgrabber/grabber.py,
	urlgrabber/keepalive.py: lots of changes... improved clarity of
	cached objects, improved debugging and logging, more options to the
	urlgrabber script.

2006-12-07  mstenner <mstenner>

	* scripts/urlgrabber, urlgrabber/grabber.py: Minor doc updates and
	error handling in grabber.py.  Complete rewrite of the urlgrabber
	script.

2006-12-05  mstenner <mstenner>

	* Minor fix to make byteranges work with some servers.  _do_grab now
	only reads as much as it needs to, rather than reading until the
	server sends EOF.