Lee Thanh
Upload 3012 files
5641073
|
raw
history blame
19.1 kB
# BLAKE3
[BLAKE3](https://github.com/BLAKE3-team/BLAKE3) running in JavaScript (node.js and browsers) via native bindings, where available, or WebAssembly.
npm install blake3
Additionally, there's a flavor of the package which is identical except that it will not download native Node.js bindings:
npm install blake3-wasm
## Table of Contents
- [Quickstart](#quickstart)
- [API](#api)
- [Node.js](#nodejs)
- [`hash(data: BinaryLike, options?: { length: number }): Buffer`](#hashdata-binarylike-options--length-number--buffer)
- [`keyedHash(key: Buffer, data: BinaryLike, options?: { length: number }): Buffer`](#keyedhashkey-buffer-data-binarylike-options--length-number--buffer)
- [`deriveKey(context: string, material: BinaryLike, options?: { length: number }): Buffer`](#derivekeycontext-string-material-binarylike-options--length-number--buffer)
- [Hasher](#hasher)
- [`createHash(): Hasher`](#createhash-hasher)
- [`createKeyed(key: Buffer): Hasher`](#createkeyedkey-buffer-hasher)
- [`createDeriveKey(context: string): Hasher`](#createderivekeycontext-string-hasher)
- [`hasher.update(data: BinaryLike): this`](#hasherupdatedata-binarylike-this)
- [`hasher.digest(encoding?: string, options?: { length: number, dispose: boolean })): Buffer | string`](#hasherdigestencoding-string-options--length-number-dispose-boolean--buffer--string)
- [`hasher.reader(options?: { dispose: boolean }): HashReader`](#hasherreaderoptions--dispose-boolean--hashreader)
- [`hasher.dispose()`](#hasherdispose)
- [HashReader](#hashreader)
- [`reader.position: bigint`](#readerposition-bigint)
- [`reader.readInto(target: Buffer): void`](#readerreadintotarget-buffer-void)
- [`reader.read(bytes: number): Buffer`](#readerreadbytes-number-buffer)
- [`reader.toString([encoding]): string`](#readertostringencoding-string)
- [`reader.toBuffer(): Buffer`](#readertobuffer-buffer)
- [`reader.dispose()`](#readerdispose)
- [`using(disposable: IDisposable, fn: disposable => T): T`](#usingdisposable-idisposable-fn-disposable--t-t)
- [Browser](#browser)
- [`hash(data: BinaryLike, options?: { length: number }): Hash`](#hashdata-binarylike-options--length-number--hash)
- [`keyedHash(key: Buffer, data: BinaryLike, options?: { length: number }): Hash`](#keyedhashkey-buffer-data-binarylike-options--length-number--hash)
- [`deriveKey(context: string, material: BinaryLike, options?: { length: number }): Hash`](#derivekeycontext-string-material-binarylike-options--length-number--hash)
- [`Hash`](#hash)
- [`hash.equals(other: Uint8Array)`](#hashequalsother-uint8array)
- [`hash.toString(encoding: 'hex' | 'base64' | 'utf8'): string`](#hashtostringencoding-hex--base64--utf8-string)
- [Hasher](#hasher-1)
- [`createHash(): Hasher`](#createhash-hasher-1)
- [`createKeyed(key: Buffer): Hasher`](#createkeyedkey-buffer-hasher-1)
- [`createDeriveKey(context: string): Hasher`](#createderivekeycontext-string-hasher-1)
- [`hasher.update(data: BinaryLike): this`](#hasherupdatedata-binarylike-this-1)
- [`hasher.digest(encoding?: 'hex' | 'base64' | 'utf8', options?: { length: number, dispose: boolean })): Hash | string`](#hasherdigestencoding-hex--base64--utf8-options--length-number-dispose-boolean--hash--string)
- [`hasher.reader(options?: { dispose: boolean }): HashReader`](#hasherreaderoptions--dispose-boolean--hashreader-1)
- [`hasher.dispose()`](#hasherdispose-1)
- [HashReader](#hashreader-1)
- [`reader.position: bigint`](#readerposition-bigint-1)
- [`reader.readInto(target: Uint8Array): void`](#readerreadintotarget-uint8array-void)
- [`reader.read(bytes: number): Hash`](#readerreadbytes-number-hash)
- [`reader.toString(encoding?: string): string`](#readertostringencoding-string-string)
- [`reader.toArray(): Uint8Array`](#readertoarray-uint8array)
- [`reader.dispose()`](#readerdispose-1)
- [`using(disposable: IDisposable, fn: disposable => T): T`](#usingdisposable-idisposable-fn-disposable--t-t-1)
- [Speed](#speed)
- [Contributing](#contributing)
- [Publishing](#publishing)
## Quickstart
If you're on Node, import the module via
```js
const blake3 = require('blake3');
blake3.hash('foo'); // => Buffer
```
If you're in the browser, import `blake3/browser`. This includes a WebAssembly binary, so you probably want to import it asynchronously, like so:
```js
import('blake3/browser').then(blake3 => {
blake3.hash('foo'); // => Uint8Array
});
```
The API is very similar in Node.js and browsers, but Node supports and returns Buffers and a wider range of input and output encoding.
More complete example:
```js
const { hash, createHash } = require('blake3');
hash('some string'); // => hash a string to a uint8array
// Update incrementally (Node and Browsers):
const hash = createHash();
stream.on('data', d => hash.update(d));
stream.on('error', err => {
// hashes use unmanaged memory in WebAssembly, always free them if you don't digest()!
hash.dispose();
throw err;
});
stream.on('end', () => finishedHash(hash.digest()));
// Or, in Node, it's also a transform stream:
createReadStream('file.txt')
.pipe(createHash())
.on('data', hash => console.log(hash.toString('hex')));
```
## API
### Node.js
The Node API can be imported via `require('blake3')`.
#### `hash(data: BinaryLike, options?: { length: number }): Buffer`
Returns a hash for the given data. The data can be a string, buffer, typedarray, array buffer, or array. By default, it generates the first 32 bytes of the hash for the data, but this is configurable. It returns a Buffer.
#### `keyedHash(key: Buffer, data: BinaryLike, options?: { length: number }): Buffer`
Returns keyed a hash for the given data. The key must be exactly 32 bytes. The data can be a string, buffer, typedarray, array buffer, or array. By default, it generates the first 32 bytes of the hash for the data, but this is configurable. It returns a Buffer.
For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.keyed_hash.html).
#### `deriveKey(context: string, material: BinaryLike, options?: { length: number }): Buffer`
The key derivation function. The data can be a string, buffer, typedarray, array buffer, or array. By default, it generates the first 32 bytes of the hash for the data, but this is configurable. It returns a Buffer.
For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.derive_key.html).
#### Hasher
The hasher is a type that lets you incrementally build a hash. It's compatible with Node's crypto hash instance. For instance, it implements a transform stream, so you could do something like:
```js
createReadStream('file.txt')
.pipe(createHash())
.on('data', hash => console.log(hash.toString('hex')));
```
##### `createHash(): Hasher`
Creates a new hasher instance using the standard hash function.
##### `createKeyed(key: Buffer): Hasher`
Creates a new hasher instance for a keyed hash. For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.keyed_hash.html).
##### `createDeriveKey(context: string): Hasher`
Creates a new hasher instance for the key derivation function. For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.derive_key.html).
##### `hasher.update(data: BinaryLike): this`
Adds data to a hash. The data can be a string, buffer, typedarray, array buffer, or array. This will throw if called after `digest()` or `dispose()`.
##### `hasher.digest(encoding?: string, options?: { length: number, dispose: boolean })): Buffer | string`
Returns the hash of the data. If an `encoding` is given, a string will be returned. Otherwise, a Buffer is returned. Optionally, you can specify the requested byte length of the hash.
If `dispose: false` is given in the options, the hash will not automatically be disposed of, allowing you to continue updating it after obtaining the current reader.
##### `hasher.reader(options?: { dispose: boolean }): HashReader`
Returns a [HashReader](#HashReader) for the current hash.
If `dispose: false` is given in the options, the hash will not automatically be disposed of, allowing you to continue updating it after obtaining the current reader.
##### `hasher.dispose()`
Disposes of unmanaged resources. You should _always_ call this if you don't call `digest()` to free umanaged (WebAssembly-based) memory.
#### HashReader
The hash reader can be returned from hashing functions. Up to 2<sup>64</sup>-1 bytes of data can be read from BLAKE3 hashes; this structure lets you read those. Note that, like `hash`, this is an object which needs to be manually disposed of.
##### `reader.position: bigint`
A property which gets or sets the position of the reader in the output stream. A `RangeError` is thrown if setting this to a value less than 0 or greater than 2<sup>64</sup>-1. Note that this is a bigint, not a standard number.
```js
reader.position += 32n; // advance the reader 32 bytes
```
##### `reader.readInto(target: Buffer): void`
Reads bytes into the target array, filling it up and advancing the reader's position. A `RangeError` is thrown if reading this data puts the reader past 2<sup>64</sup>-1 bytes.
##### `reader.read(bytes: number): Buffer`
Reads and returns the given number of bytes from the reader, and advances the position. A `RangeError` is thrown if reading this data puts the reader past 2<sup>64</sup>-1 bytes.
##### `reader.toString([encoding]): string`
Converts first 32 bytes of the hash to a string with the given encoding. Defaults to hex encoding.
##### `reader.toBuffer(): Buffer`
Converts first 32 bytes of the hash to a Buffer.
##### `reader.dispose()`
Disposes of unmanaged resources. You should _always_ call this to free umanaged (WebAssembly-based) memory, or you application will leak memory.
#### `using(disposable: IDisposable, fn: disposable => T): T`
A helper method that takes a disposable, and automatically calls the dispose method when the function returns, or the promise returned from the function is settled.
```js
// read and auto-dispose the first 64 bytes
const first64Bytes = using(hash.reader(), reader => reader.toBuffer(64));
// you can also return promises/use async methods:
using(hash.reader(), async reader => {
do {
await send(reader.read(64));
} while (needsMoreData());
});
```
### Browser
The browser API can be imported via `import('blake3/browser')`, which works well with Webpack.
If you aren't using a bundler or using a more "pure" bundler like Parcel, you can import `blake3/browser-async` which exports a function to asynchronously load the WebAssembly code and resolves to the package contents.
```js
import load from 'blake3/browser-async';
load().then(blake3 => {
console.log(blake3.hash('hello world'));
});
```
#### `hash(data: BinaryLike, options?: { length: number }): Hash`
Returns a hash for the given data. The data can be a string, typedarray, array buffer, or array. By default, it generates the first 32 bytes of the hash for the data, but this is configurable. It returns a [Hash](#Hash) instance.
#### `keyedHash(key: Buffer, data: BinaryLike, options?: { length: number }): Hash`
Returns keyed a hash for the given data. The key must be exactly 32 bytes. The data can be a string, typedarray, array buffer, or array. By default, it generates the first 32 bytes of the hash for the data, but this is configurable. It returns a [Hash](#Hash) instance.
For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.keyed_hash.html).
#### `deriveKey(context: string, material: BinaryLike, options?: { length: number }): Hash`
The key derivation function. The data can be a string, typedarray, array buffer, or array. By default, it generates the first 32 bytes of the hash for the data, but this is configurable. It returns a [Hash](#Hash) instance.
For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.derive_key.html).
#### `Hash`
A Hash is the type returned from hash functions and the hasher in the browser. It's a `Uint8Array` with a few additional helper methods.
##### `hash.equals(other: Uint8Array)`
Returns whether this hash equals the other hash, via a constant-time equality check.
##### `hash.toString(encoding: 'hex' | 'base64' | 'utf8'): string`
#### Hasher
The hasher is a type that lets you incrementally build a hash. For instance, you can hash a `fetch`ed page like:
```js
const res = await fetch('https://example.com');
const body = await res.body;
const hasher = blake3.createHash();
const reader = body.getReader();
while (true) {
const { done, value } = await reader.read();
if (done) {
break;
}
hasher.update(value);
}
console.log('Hash of', res.url, 'is', hasher.digest('hex'));
```
Converts the hash to a string with the given encoding.
##### `createHash(): Hasher`
Creates a new hasher instance using the standard hash function.
##### `createKeyed(key: Buffer): Hasher`
Creates a new hasher instance for a keyed hash. For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.keyed_hash.html).
##### `createDeriveKey(context: string): Hasher`
Creates a new hasher instance for the key derivation function. For more information, see [the blake3 docs](https://docs.rs/blake3/0.1.3/blake3/fn.derive_key.html).
##### `hasher.update(data: BinaryLike): this`
Adds data to a hash. The data can be a string, buffer, typedarray, array buffer, or array. This will throw if called after `digest()` or `dispose()`.
##### `hasher.digest(encoding?: 'hex' | 'base64' | 'utf8', options?: { length: number, dispose: boolean })): Hash | string`
Returns the hash of the data. If an `encoding` is given, a string will be returned. Otherwise, a [Hash](#hash) is returned. Optionally, you can specify the requested byte length of the hash.
If `dispose: false` is given in the options, the hash will not automatically be disposed of, allowing you to continue updating it after obtaining the current reader.
##### `hasher.reader(options?: { dispose: boolean }): HashReader`
Returns a [HashReader](#HashReader) for the current hash.
If `dispose: false` is given in the options, the hash will not automatically be disposed of, allowing you to continue updating it after obtaining the current reader.
##### `hasher.dispose()`
Disposes of unmanaged resources. You should _always_ call this if you don't call `digest()` to free umanaged (WebAssembly-based) memory.
#### HashReader
The hash reader can be returned from hashing functions. Up to 2<sup>64</sup>-1 bytes of data can be read from BLAKE3 hashes; this structure lets you read those. Note that, like `hash`, this is an object which needs to be manually disposed of.
##### `reader.position: bigint`
A property which gets or sets the position of the reader in the output stream. A `RangeError` is thrown if setting this to a value less than 0 or greater than 2<sup>64</sup>-1. Note that this is a bigint, not a standard number.
```js
reader.position += 32n; // advance the reader 32 bytes
```
##### `reader.readInto(target: Uint8Array): void`
Reads bytes into the target array, filling it up and advancing the reader's position. A `RangeError` is thrown if reading this data puts the reader past 2<sup>64</sup>-1 bytes.
##### `reader.read(bytes: number): Hash`
Reads and returns the given number of bytes from the reader, and advances the position. A `RangeError` is thrown if reading this data puts the reader past 2<sup>64</sup>-1 bytes.
##### `reader.toString(encoding?: string): string`
Converts first 32 bytes of the hash to a string with the given encoding. Defaults to hex encoding.
##### `reader.toArray(): Uint8Array`
Converts first 32 bytes of the hash to an array.
##### `reader.dispose()`
Disposes of unmanaged resources. You should _always_ call this to free umanaged (WebAssembly-based) memory, or you application will leak memory.
#### `using(disposable: IDisposable, fn: disposable => T): T`
A helper method that takes a disposable, and automatically calls the dispose method when the function returns, or the promise returned from the function is settled.
```js
// read and auto-dispose the first 64 bytes
const first64Bytes = using(hash.reader(), reader => reader.toArray(64));
// you can also return promises/use async methods:
using(hash.reader(), async reader => {
do {
await send(reader.read(64));
} while (needsMoreData());
});
```
## Speed
> Native Node.js bindings are a work in progress.
You can run benchmarks by installing `npm install -g @c4312/matcha`, then running `matcha benchmark.js`. These are the results running on Node 12 on my MacBook. Blake3 is significantly faster than Node's built-in hashing.
276,000 ops/sec > 64B#md5 (4,240x)
263,000 ops/sec > 64B#sha1 (4,040x)
271,000 ops/sec > 64B#sha256 (4,160x)
1,040,000 ops/sec > 64B#blake3 wasm (15,900x)
625,000 ops/sec > 64B#blake3 native (9,590x)
9,900 ops/sec > 64KB#md5 (152x)
13,900 ops/sec > 64KB#sha1 (214x)
6,470 ops/sec > 64KB#sha256 (99.2x)
6,410 ops/sec > 64KB#blake3 wasm (98.4x)
48,900 ops/sec > 64KB#blake3 native (750x)
106 ops/sec > 6MB#md5 (1.63x)
150 ops/sec > 6MB#sha1 (2.3x)
69.2 ops/sec > 6MB#sha256 (1.06x)
65.2 ops/sec > 6MB#blake3 wasm (1x)
502 ops/sec > 6MB#blake3 native (7.7x)
## Contributing
This build is a little esoteric due to the mixing of languages. We use a `Makefile` to coodinate things.
To get set up, you'll want to open the repository in VS Code. Make sure you have [Remote Containers](https://marketplace.visualstudio.com/items?itemName=ms-vscode-remote.remote-containers) installed, and then accept the "Reopen in Container" prompt when opening the folder. This will get the environment set up with everything you need. Then, run `make prepare` to install local dependencies.
Finally, `make` will create a build for you; you can run `make MODE=release` for a production release, and certainly should if you want to [benchmark it](#speed).
- Rust code is compiled from `src/lib.rs` to `pkg/browser` and `pkg/node`
- TypeScript code is compiled from `ts/*.ts` into `dist`
### Publishing
In case I get hit by a bus or get other contributors, these are the steps for publishing:
1. Get all your code ready to go in master, pushed up to Github.
2. Run `make prepare-binaries`. This will update the branch `generate-binary`, which kicks off a build via Github actions to create `.node` binaries for every relevant Node.js version.
3. When the build completes, it'll generate a zip file of artifacts. Download those.
4. Back on master, run `npm version <type>` to update the version in git. `git push --tags`.
5. On Github, upload the contents of the artifacts folder to the release for the newly tagged version.
6. Run `npm publish`.