@@ -15,8 +15,9 @@ static websites, blogs, and wikis and excludes most social media sites.
15
15
16
16
Download ⬇︎
17
17
--------
18
- * [ Mac OS X 10.7 (Lion)] ( https://github.com/downloads/davidfstr/Crystal-Web-Archiver/crystal-mac-1.0.dmg )
19
- * [ Windows XP and later] ( https://github.com/downloads/davidfstr/Crystal-Web-Archiver/crystal-win-1.0.exe )
18
+
19
+ * [ macOS 10.14 and later] ( https://github.com/davidfstr/Crystal-Web-Archiver/releases/download/v1.1.0b/crystal-mac-1.1.0b.dmg )
20
+ * [ Windows 7, 8, 10] ( https://github.com/davidfstr/Crystal-Web-Archiver/releases/download/v1.1.0b/crystal-win-1.1.0b.exe )
20
21
21
22
22
23
Quickstart ⭐
@@ -124,3 +125,96 @@ Related Projects ⎋
124
125
easy for automated crawlers (rather than for humans) to download websites.
125
126
126
127
[ webcrystal ] : http://dafoster.net/projects/webcrystal/
128
+
129
+
130
+ Release Notes ⋮
131
+ -------------
132
+
133
+ ### Future
134
+
135
+ * See the [ Roadmap] ( https://github.com/davidfstr/Crystal-Web-Archiver/wiki/Roadmap ) .
136
+
137
+ ### v1.1.0b <small >(March 22, 2021)</small >
138
+
139
+ Our first beta release brings support for downloading more complex static sites,
140
+ recognizing vastly more link types than ever before. It also supports various
141
+ kinds of * dynamic* link-rewriting (🧠), beyond the usual static link-rewriting.
142
+
143
+ Additionally the code has been modernized to work properly on the latest
144
+ operating systems and use newer versions of the BeautifulSoup parser and
145
+ the wxWidgets UI library. Unfortunately this has meant dropping support for
146
+ some older macOS versions and Windows XP.
147
+
148
+ * Parsing improvements
149
+ * Recognize ` url(*) ` and ` url("*") ` references inside CSS!
150
+ * Recognize http(s):// references inside ` <script> ` tags! 🧠
151
+ * Recognize http(s):// references inside custom and unknown attribute types! 🧠
152
+ * Recognize many more link types:
153
+ * Recognize ` <* background=*> ` links
154
+ * Recognize favicon links
155
+ * Fix scoping issue that made detection of * multiple* links of the format
156
+ ` <input type='button' onclick='*.location = "*";'> ` unreliable.
157
+ * Fix Content-Type and Location headers to be recognized in case-insensitive fashion,
158
+ fixing redirects and encoding issues on many archived sites.
159
+ * Support rudimentary parsing of pages containing frames (and ` <frameset> ` tags),
160
+ with a new "basic" parser that can be used instead of the "soup" parser.
161
+ * Fix infinite recursion if a resource identifies itself as a self-embedded resource.
162
+
163
+ * Downloading improvements
164
+ * Save download errors in archive more reliably
165
+
166
+ * Serving & link-rewriting improvements
167
+ * Dynamically rewrite incoming links from unparseable site-relative and
168
+ protocol-relative URLs in archived resource revisions! 🧠
169
+ * Did require altering the request URL format to be more distinct: ** (Breaking Change)**
170
+ * Old format: ` http://localhost:2797/http/www.example.com/index.html `
171
+ * New format: ` http://localhost:2797/_/http/www.example.com/index.html `
172
+ * Dynamically download accessed resources that are a member of an existing
173
+ resource group. 🧠
174
+ * Does allow many unparseable resource-relative URLs in archived
175
+ resources to be recognized and downloaded successfully.
176
+ * Better header processing:
177
+ * Recognize many more headers:
178
+ * Recognize standard headers related to CORS, Timing, Cookies,
179
+ HTTPS & Certificates, Logging, Referer, Protocol Upgrades,
180
+ and X-RateLimit.
181
+ * Recognize vendor-specific headers from AWS Cloudfront,
182
+ Cloudflare, Fastly, and Google Cloud.
183
+ * Match headers against the header whitelist and blacklist in case-insensitive fashion,
184
+ allowing more headers to be served correctly and reducing unknown-header warnings.
185
+ * Fix to serve appropriate error page when viewing resource in archive
186
+ that was fetched with an error, rather than crashing.
187
+ * Fix transformed HTML and CSS documents to be reported as charset=utf-8 correctly.
188
+ * Automatically fixup URLs lacking a path to have a / path.
189
+ * Don't attempt to rewrite mailto or javascript URLs.
190
+ * Don't print error if browser drops connection early.
191
+ * Avoid printing binary data to console when handling incoming binary protocol message.
192
+ * This can happen if archived JavaScript attempts to force fetching a
193
+ archived resource over HTTPS from an http:// URL.
194
+ * Colorize logged output by default. 🎨
195
+
196
+ * Modernize codebase
197
+ * Upgrade Python 2.7 -> 3.8
198
+ * Upgrade wxPython 2.x -> 4
199
+ * Upgrade BeautifulSoup 2.x -> 4
200
+ * Track and pin dependencies with Poetry
201
+ * Change supported operating system versions ** (Breaking Change)**
202
+ * Drop support for Windows XP. Only Windows 7, 8, and 10 are now supported.
203
+ * Drop support for Mac OS X 10.7 - 10.13. Only macOS 10.14+ is now supported.
204
+
205
+ * Miscellaneous
206
+ * User-Agent: Alter to advertise correct version and project URL.
207
+ * Logging changes:
208
+ * Mac: Redirect stdout and stderr to file when running as binary.
209
+ * Windows: Alter location of stdout and stderr log files to be in %APPDATA%
210
+ rather than beside the .exe, to enable logging even when Crystal is running
211
+ from a locked volume.
212
+ * Other fixes:
213
+ * Mac: Fix wxPython warning around inserting an empty list of items to a list.
214
+ * Fix closing the initial welcome dialog to be correctly interpreted as Quit.
215
+ * Documentation improvements to the README
216
+ * Upgrade development status from Alpha -> Beta 🎉
217
+
218
+ ### v1.0.0a <small >(January 24, 2012)</small >
219
+
220
+ * Initial version
0 commit comments