UNPKG

2 kBMarkdownView Raw
1# 5.1.0 (2015-09-22)
2
3## New features
4
5- New ```contractAdjecent``` option to contract adjecent targets. See readme.
6
7- When finding nearest href, it will not only look at children but all other successors.
8
9# 5.0.0 (2015-05-21)
10
11## Breaking changes
12
13- To avoid the heavy iconv module, input support for feeds and goldwasher xml has been dropped. These were both very rare use cases and the native compilation required for iconv was both error prone and not very lightweight. The option to output to either XML or feeds is still available.
14
15- The XML output format has changed. Keywords will now be correctly put inside a ```<keywords>``` element.
16
17
18# 4.1.2 (2015-05-17)
19
20## Bugfixes
21
22-- Replace dashes (-) with spaces.
23
24# 4.1.1 (2015-05-17)
25
26## Bugfixes
27
28-- Now preserves numbers in texts.
29
30# 4.1.0 (2015-05-17)
31
32## Features
33
34- No longer uses slugification. This resulted in the removal of characters not present in the English alphabet, such as scandinavian, greek, russian etc. These are now preserved.
35
36# 4.0.0 (2015-05-16)
37
38## Features
39
40- It is now possible to both use goldwasher for scraping and conversion of its own formats. Thus, the input can now be any of the following: HTML, XML, cheerio object, array of goldwasher items, goldwasher XML or even an RSS/Atom feed. The output can be either JSON, XML, Atom or RSS. Note that feeds do not contain the same amount of information as JSON or XML.
41- The parameter "batch" has been added to the format. It contains a UUID that will be the same for all nuggets of a goldwasher batch.
42- The parameter "source" has been added to the format. It contains the original URL of the scraped page.
43
44## Breaking changes
45
46- The flags for individual goldwasher format keys have been removed. You will thus always get full goldwasher formatted objects out. If you need to remove keys from them, do so afterwards.
47- If upgrading from older versions, note that ```target``` has been renamed to the more proper ```selector``` and ```format``` to ```output```.
\No newline at end of file