UNPKG

12.3 kBMarkdownView Raw
1# jest-worker
2
3Module for executing heavy tasks under forked processes in parallel, by providing a `Promise` based interface, minimum overhead, and bound workers.
4
5The module works by providing an absolute path of the module to be loaded in all forked processes. All methods are exposed on the parent process as promises, so they can be `await`'ed. Child (worker) methods can either be synchronous or asynchronous.
6
7The module also implements support for bound workers. Binding a worker means that, based on certain parameters, the same task will always be executed by the same worker. The way bound workers work is by using the returned string of the `computeWorkerKey` method. If the string was used before for a task, the call will be queued to the related worker that processed the task earlier; if not, it will be executed by the first available worker, then sticked to the worker that executed it; so the next time it will be processed by the same worker. If you have no preference on the worker executing the task, but you have defined a `computeWorkerKey` method because you want _some_ of the tasks to be sticked, you can return `null` from it.
8
9The list of exposed methods can be explicitly provided via the `exposedMethods` option. If it is not provided, it will be obtained by requiring the child module into the main process, and analyzed via reflection. Check the "minimal example" section for a valid one.
10
11## Install
12
13```sh
14yarn add jest-worker
15```
16
17## Example
18
19This example covers the minimal usage:
20
21### File `parent.js`
22
23```js
24import {Worker as JestWorker} from 'jest-worker';
25
26async function main() {
27 const worker = new JestWorker(require.resolve('./worker'));
28 const result = await worker.hello('Alice'); // "Hello, Alice"
29}
30
31main();
32```
33
34### File `worker.js`
35
36```js
37export function hello(param) {
38 return `Hello, ${param}`;
39}
40```
41
42## Experimental worker
43
44Node shipped with [`worker_threads`](https://nodejs.org/api/worker_threads.html), a "threading API" that uses `SharedArrayBuffers` to communicate between the main process and its child threads. This feature can significantly improve the communication time between parent and child processes in `jest-worker`.
45
46To use `worker_threads` instead of default `child_process` you have to pass `enableWorkerThreads: true` when instantiating the worker.
47
48## API
49
50The `Worker` export is a constructor that is initialized by passing the worker path, plus an options object.
51
52### `workerPath: string` (required)
53
54Node module name or absolute path of the file to be loaded in the child processes. Use `require.resolve` to transform a relative path into an absolute one.
55
56### `options: Object` (optional)
57
58#### `computeWorkerKey: (method: string, ...args: Array<unknown>) => string | null` (optional)
59
60Every time a method exposed via the API is called, `computeWorkerKey` is also called in order to bound the call to a worker. This is useful for workers that are able to cache the result or part of it. You bound calls to a worker by making `computeWorkerKey` return the same identifier for all different calls. If you do not want to bind the call to any worker, return `null`.
61
62The callback you provide is called with the method name, plus all the rest of the arguments of the call. Thus, you have full control to decide what to return. Check a practical example on bound workers under the "bound worker usage" section.
63
64By default, no process is bound to any worker.
65
66#### `enableWorkerThreads: boolean` (optional)
67
68By default, `jest-worker` will use `child_process` threads to spawn new Node.js processes. If you prefer [`worker_threads`](https://nodejs.org/api/worker_threads.html) instead, pass `enableWorkerThreads: true`.
69
70#### `exposedMethods: ReadonlyArray<string>` (optional)
71
72List of method names that can be called on the child processes from the parent process. You cannot expose any method named like a public `Worker` method, or starting with `_`. If you use method auto-discovery, then these methods will not be exposed, even if they exist.
73
74#### `forkOptions: ForkOptions` (optional)
75
76Allow customizing all options passed to `child_process.fork`. By default, some values are set (`cwd`, `env`, `execArgv` and `serialization`), but you can override them and customize the rest. For a list of valid values, check [the Node documentation](https://nodejs.org/api/child_process.html#child_process_child_process_fork_modulepath_args_options).
77
78#### `idleMemoryLimit: number` (optional)
79
80Specifies the memory limit for workers before they are recycled and is primarily a work-around for [this issue](https://github.com/facebook/jest/issues/11956);
81
82After the worker has executed a task the memory usage of it is checked. If it exceeds the value specified the worker is killed and restarted. If no limit is set this process does not occur. The limit can be specified in 2 ways:
83
84- `<= 1` - The value is assumed to be a percentage of system memory. So 0.5 sets the memory limit of the worker to half of the total system memory
85- `\> 1` - Assumed to be a fixed byte value. Because of the previous rule if you wanted a value of 1 byte (I don't know why) you could use `1.1`.
86
87#### `maxRetries: number` (optional)
88
89Maximum amount of times that a dead child can be re-spawned, per call. Defaults to `3`, pass `Infinity` to allow endless retries.
90
91#### `numWorkers: number` (optional)
92
93Amount of workers to spawn. Defaults to the number of CPUs minus 1.
94
95#### `resourceLimits: ResourceLimits` (optional)
96
97The `resourceLimits` option which will be passed to `worker_threads` workers.
98
99#### `silent: Boolean` (optional)
100
101Set to false for `stdout` and `stderr` to be logged to console.
102
103By default this is true.
104
105#### `setupArgs: Array<unknown>` (optional)
106
107The arguments that will be passed to the `setup` method during initialization.
108
109#### `taskQueue: TaskQueue` (optional)
110
111The task queue defines in which order tasks (method calls) are processed by the workers. `jest-worker` ships with a `FifoQueue` and `PriorityQueue`:
112
113- `FifoQueue` (default): Processes the method calls (tasks) in the call order.
114- `PriorityQueue`: Processes the method calls by a computed priority in natural ordering (lower priorities first). Tasks with the same priority are processed in any order (FIFO not guaranteed). The constructor accepts a single argument, the function that is passed the name of the called function and the arguments and returns a numerical value for the priority: `new require('jest-worker').PriorityQueue((method, filename) => filename.length)`.
115
116#### `WorkerPool: new (workerPath: string, options?: WorkerPoolOptions) => WorkerPoolInterface` (optional)
117
118Provide a custom WorkerPool class to be used for spawning child processes.
119
120#### `workerSchedulingPolicy: 'round-robin' | 'in-order'` (optional)
121
122Specifies the policy how tasks are assigned to workers if multiple workers are _idle_:
123
124- `round-robin` (default): The task will be sequentially distributed onto the workers. The first task is assigned to the worker 1, the second to the worker 2, to ensure that the work is distributed across workers.
125- `in-order`: The task will be assigned to the first free worker starting with worker 1 and only assign the work to worker 2 if the worker 1 is busy.
126
127Tasks are always assigned to the first free worker as soon as tasks start to queue up. The scheduling policy does not define the task scheduling which is always first-in, first-out.
128
129## JestWorker
130
131### Methods
132
133The returned `JestWorker` instance has all the exposed methods, plus some additional ones to interact with the workers itself:
134
135#### `getStdout(): Readable`
136
137Returns a `ReadableStream` where the standard output of all workers is piped. Note that the `silent` option of the child workers must be set to `true` to make it work. This is the default set by `jest-worker`, but keep it in mind when overriding options through `forkOptions`.
138
139#### `getStderr(): Readable`
140
141Returns a `ReadableStream` where the standard error of all workers is piped. Note that the `silent` option of the child workers must be set to `true` to make it work. This is the default set by `jest-worker`, but keep it in mind when overriding options through `forkOptions`.
142
143#### `end()`
144
145Finishes the workers by killing all workers. No further calls can be done to the `Worker` instance.
146
147Returns a Promise that resolves with `{ forceExited: boolean }` once all workers are dead. If `forceExited` is `true`, at least one of the workers did not exit gracefully, which likely happened because it executed a leaky task that left handles open. This should be avoided, force exiting workers is a last resort to prevent creating lots of orphans.
148
149**Note:**
150
151`await`ing the `end()` Promise immediately after the workers are no longer needed before proceeding to do other useful things in your program may not be a good idea. If workers have to be force exited, `jest-worker` may go through multiple stages of force exiting (e.g. SIGTERM, later SIGKILL) and give the worker overall around 1 second time to exit on its own. During this time, your program will wait, even though it may not be necessary that all workers are dead before continuing execution.
152
153Consider deliberately leaving this Promise floating (unhandled resolution). After your program has done the rest of its work and is about to exit, the Node process will wait for the Promise to resolve after all workers are dead as the last event loop task. That way you parallelized computation time of your program and waiting time and you didn't delay the outputs of your program unnecessarily.
154
155### Worker IDs
156
157Each worker has a unique id (index that starts with `'1'`), which is available inside the worker as `process.env.JEST_WORKER_ID`.
158
159## Setting up and tearing down the child process
160
161The child process can define two special methods (both of them can be asynchronous):
162
163- `setup()`: If defined, it's executed before the first call to any method in the child.
164- `teardown()`: If defined, it's executed when the farm ends.
165
166# More examples
167
168## Standard usage
169
170This example covers the standard usage:
171
172### File `parent.js`
173
174```js
175import {Worker as JestWorker} from 'jest-worker';
176
177async function main() {
178 const myWorker = new JestWorker(require.resolve('./worker'), {
179 exposedMethods: ['foo', 'bar', 'getWorkerId'],
180 numWorkers: 4,
181 });
182
183 console.log(await myWorker.foo('Alice')); // "Hello from foo: Alice"
184 console.log(await myWorker.bar('Bob')); // "Hello from bar: Bob"
185 console.log(await myWorker.getWorkerId()); // "3" -> this message has sent from the 3rd worker
186
187 const {forceExited} = await myWorker.end();
188 if (forceExited) {
189 console.error('Workers failed to exit gracefully');
190 }
191}
192
193main();
194```
195
196### File `worker.js`
197
198```js
199export function foo(param) {
200 return `Hello from foo: ${param}`;
201}
202
203export function bar(param) {
204 return `Hello from bar: ${param}`;
205}
206
207export function getWorkerId() {
208 return process.env.JEST_WORKER_ID;
209}
210```
211
212## Bound worker usage:
213
214This example covers the usage with a `computeWorkerKey` method:
215
216### File `parent.js`
217
218```js
219import {Worker as JestWorker} from 'jest-worker';
220
221async function main() {
222 const myWorker = new JestWorker(require.resolve('./worker'), {
223 computeWorkerKey: (method, filename) => filename,
224 });
225
226 // Transform the given file, within the first available worker.
227 console.log(await myWorker.transform('/tmp/foo.js'));
228
229 // Wait a bit.
230 await sleep(10000);
231
232 // Transform the same file again. Will immediately return because the
233 // transformed file is cached in the worker, and `computeWorkerKey` ensures
234 // the same worker that processed the file the first time will process it now.
235 console.log(await myWorker.transform('/tmp/foo.js'));
236
237 const {forceExited} = await myWorker.end();
238 if (forceExited) {
239 console.error('Workers failed to exit gracefully');
240 }
241}
242
243main();
244```
245
246### File `worker.js`
247
248```js
249import babel from '@babel/core';
250
251const cache = Object.create(null);
252
253export function transform(filename) {
254 if (cache[filename]) {
255 return cache[filename];
256 }
257
258 // jest-worker can handle both immediate results and thenables. If a
259 // thenable is returned, it will be await'ed until it resolves.
260 return babel.transformFileAsync(filename).then(result => {
261 cache[filename] = result;
262
263 return result;
264 });
265}
266```