UNPKG

1.79 kBMarkdownView Raw
1# 5.0.0 (2015-21-05)
2
3## Breaking changes
4
5- To avoid the heavy iconv module, input support for feeds and goldwasher xml has been dropped. These were both very rare use cases and the native compilation required for iconv was both error prone and not very lightweight. The option to output to either XML or feeds is still available.
6
7- The XML output format has changed. Keywords will now be correctly put inside a ```<keywords>``` element.
8
9
10# 4.1.2 (2015-17-05)
11
12## Bugfixes
13
14-- Replace dashes (-) with spaces.
15
16# 4.1.1 (2015-17-05)
17
18## Bugfixes
19
20-- Now preserves numbers in texts.
21
22# 4.1.0 (2015-17-05)
23
24## Features
25
26- No longer uses slugification. This resulted in the removal of characters not present in the English alphabet, such as scandinavian, greek, russian etc. These are now preserved.
27
28# 4.0.0 (2015-16-05)
29
30## Features
31
32- It is now possible to both use goldwasher for scraping and conversion of its own formats. Thus, the input can now be any of the following: HTML, XML, cheerio object, array of goldwasher items, goldwasher XML or even an RSS/Atom feed. The output can be either JSON, XML, Atom or RSS. Note that feeds do not contain the same amount of information as JSON or XML.
33- The parameter "batch" has been added to the format. It contains a UUID that will be the same for all nuggets of a goldwasher batch.
34- The parameter "source" has been added to the format. It contains the original URL of the scraped page.
35
36## Breaking changes
37
38- The flags for individual goldwasher format keys have been removed. You will thus always get full goldwasher formatted objects out. If you need to remove keys from them, do so afterwards.
39- If upgrading from older versions, note that ```target``` has been renamed to the more proper ```selector``` and ```format``` to ```output```.
\No newline at end of file