jiffy

Graphique des révisions

Auteur	SHA1	Message	Date
Paul J. Davis	f74b04fd11	Fix bug in bytes_per_red option Turns out this was never implemented correctly since it was referring to the original bytes_per_iter atom.	il y a 4 ans
Paul J. Davis	0ba322e421	Fix binary leak when encoding large strings This bug was due to an interaction between two optimizations. If we attempt to flush the buffer before any bytes are used we refused. However, in enc_ensure we were not checking whether the buffer was actually flushed so we would allocate a new buffer for the request. The easiest way to encounter this issue was by encoding a raw binary longer than 2041 bytes (i.e., `jiffy:encode(<<"stuff...">>).`).	il y a 5 ans
Joan Touzet	265be337f8	Force Windows to export inlined functions	il y a 5 ans
Paul J. Davis	4a3785eb70	Use from_array instead of positional versions	il y a 6 ans
Paul J. Davis	591448da47	Fix backwards compatibility	il y a 6 ans
Paul J. Davis	56b25bb6b7	Fix R14 support	il y a 6 ans
Paul J. Davis	dd831c4f2c	Fix support of VMs pre-17.x	il y a 6 ans
Paul J. Davis	8813ee3eec	Upgrade to double-converson 3.1.4	il y a 6 ans
John Högberg	48ff666091	Remove -fno-strict-aliasing We don't do fishy things like type punning so it really isn't necessary, and supplying it prevents certain optimizations.	il y a 6 ans
John Högberg	a3b03d5aec	Get rid of separate unsigned/signed buffers	il y a 6 ans
John Högberg	a1196ba754	Never expand the encode buffer; emit and restart This greatly increases the performance of long string encodes as we won't need to copy intermediate results over and over.	il y a 6 ans
John Högberg	0a72ec40ba	Fix erroneous state check in the decoder If the input contained a mismatched end-of-array/object, the stack could become empty before a call to dec_curr, which would look beyond the bounds of the stack. If the value at this invalid position happened to be st_array, we would pop too much from the stack and overwrite the data that came before it. This commit fixes this by letting dec_pop return the previous state or st_invalid if the stack is empty, letting us exit gracefully if the state isn't what we expect it to be. dec_pop_assert is identical to the old dec_pop, tearing down the emulator on internal errors.	il y a 6 ans
John Högberg	ac950d657b	Move all atom checks under enif_is_atom	il y a 6 ans
John Högberg	6bc427e405	Use the result map for key dedupe This is a wee bit more cache-friendly than using a std::set with string keys.	il y a 6 ans
John Högberg	58c49522da	Refactor trapping and trap more often during decode	il y a 6 ans
John Högberg	706527d100	Skip erroneous UTF-8 validation for atoms We requested atoms in latin1 and then handled them as utf-8, erroring out on some valid atoms and performing pointless validation on others.	il y a 6 ans
John Högberg	182393ff6a	Use enif_is_identical for equality checks	il y a 6 ans
John Högberg	596a61ebfc	Replace sprintf with a dedicated integer print routine	il y a 6 ans
John Högberg	ae712b58c5	Skip redundant enif_is_empty_list checks during encode enif_get_list_cell fails when the list is empty.	il y a 6 ans
John Högberg	798e4a7dd2	Use realloc instead of doing it manually	il y a 6 ans
John Högberg	e274565cf1	Walk through strings once when encoding	il y a 6 ans
John Högberg	4ab9a68313	Use an array for the position stack rather than an Erlang list	il y a 6 ans
John Högberg	4ad1eb3a6d	sizeof(char) == 1 by definition	il y a 6 ans
Jihyun Yu	051a74338c	fix bug on hex escape table number of items on `hexvals` is 128 while table size is 256, so remaining 128 items are filled with zero. As a result, values in \xf0-\xff will be treated as zero while should be rejected.	il y a 6 ans
David Hull	5c29452a1e	Fix decoding of "\uDBFF\uDFFF" surrogate pair.	il y a 6 ans
Lynn Gabbay	323568e9d5	fixed issue 162 regarding duplicate keys in objects	il y a 7 ans
Paul J. Davis	dddb392f88	Add `copy_strings` feature Some users of Jiffy have experienced issues when decoding large JSON documents. Normally Jiffy expects smallish documents and returns any strings as sub-binaries. When dealing with large documents these sub-binary references can keep a large amount of RAM around unless the user goes through and applies `binary:copy/1` on every string returned from Jiffy. This however causes a large amount of CPU usage to do something that Jiffy could do as it builds the JSON structure. The `copy_strings` decoder option does exactly this. Instead of returning sub-binaries Jiffy now copies every string into a newly allocated binary. Users report that this fixes the memory issues while also not negatively affecting performance significantly.	il y a 7 ans
Paul J. Davis	128811a7cf	Add `dedupe_keys` option You can no optionally request that keys are deduplicate inside of Jiffy instead of having to perform that operation in Erlang.	il y a 7 ans
David Hull	df791ef638	Tighten string buffer size calculation in enc_string. When "\u"-escaping a Unicode character, the esc_extra value doesn't need to include the number of bytes in the input string. That is, if a three-byte UTF-8 character is being escaped to a six-byte "\uXXXX" sequence, esc_extra only needs to be increased by 3.	il y a 7 ans
Paul J. Davis	4c0bfbc0fa	Fix enc_long for 64-bit Windows Originally reported by @NorthNick on apache/couchdb-jiffy.	il y a 8 ans
Paul J. Davis	e43ea64ae0	Remove old debug printing	il y a 9 ans
Jon Parise	fa825b6fd6	Destroy map iterators once we're done with them. Each call to enif_map_iterator_create() must be paired with a call to enif_map_iterator_destroy(). Otherwise, we'll leak memory. Fixes #112	il y a 9 ans
Paul J. Davis	e008c0c3ff	Fix compiler warning on gcc 5.1.0	il y a 9 ans
Paul J. Davis	454928ff34	Revamp yields back to Erlang In the original PR for `return_trailer` @vlm pointed out that I wasn't using enif_consume_timeslice correctly. This fixes that by changing out its called. Previously we attempted to define the total number of bytes to decode or encode in a single NIF call and then would consume as much of the timeslice as we processed. This is wrong because we may start the NIF call with less than an entire timeslice left. The new approach is to define the number of bytes to encode or decode per reduction and then iteratively call enif_consume_timeslice until it indicates that we should return.	il y a 9 ans
Paul J. Davis	6d2278e906	Add new return_trailer option Previously Jiffy would throw an error about trailing data if there is any non-whitespace character encounter after the first term had been decoded. This patch adds a decoder option `return_trailer` that will instead return a sub-binary starting at the first non-whitespace character. This allows users to be able to decode multiple terms from a single iodata() term. Thanks to @vlm for the original patch.	il y a 9 ans
Paul J. Davis	5d6b2651c9	Update double-conversion to latest master	il y a 9 ans
Jeremie Lasalle Ratelle	1784ca2ab5	Add an option to escape forward slashes This brings back escaping forward slashes as an option during encoding. Default is still not to escape.	il y a 10 ans
Jeremie Lasalle Ratelle	cd1b263208	Add an option to control null value decoding atom This is pretty much a generalisation of the use_nil option to support an arbitrary atom.	il y a 10 ans
Emil Falk	587f143e27	Changed pos to unsigned int to prevent warning from happening.	il y a 10 ans
Paul J. Davis	137d3d94b6	Account for char possibly being unsigned This sounds rather insane to me but I've managed to show that `(char) -1` is converted to 255 on some platforms. This was reproduced on ppc64el via Qemu on OS X. A simple program that does `fprintf(stderr, "%d\r\n", (char) -1);` prints 255 to the console. Rather than rely on the signedness of a char I've just updated things to use an unsigned char (which hopefully is never signed) and replaced -1 with 255 for the sentinel value when converting hex values. Thanks to Balint Reczey (@rbalint) for the report. Fixes #74	il y a 10 ans
Paul J. Davis	6318efa798	Fix memory leak when encoding bare bignums This fixes a leak when encoding a bare bignum. Technically it would be possible to hit this memory leak randomly with bignums in objects but the chances are highly unlikely. Thanks to @miriampena for the issue. Fixes #69	il y a 10 ans
Paul J. Davis	5cd89c1eda	Persist the `val` register across yield The `val` variable is a register value that we need to be able to return at any time from `decode_iter`. If it happened that a yield was triggered while processing trailing whitespace the lack of persistance caused decode to return a term intialized from a random integer value. Obviously the Erlang VM did not enjoy this. Thanks to @michalpalka for the report. Fixes #66	il y a 10 ans
Paul J. Davis	f9095c5258	Improved encoder errors This updates encoder errors to report the actual Erlang value that caused the error. This should make it easier to debug errors when generating JSON.	il y a 10 ans
Paul J. Davis	5eb499d73e	Tweak the nil encoding logic I must've managed to miss the PR update from Stanislav the other day when merging this.	il y a 10 ans
Stanislav Vishnevskiy	2dbf89f51c	Improved Elixir compatibility This implements the `use_nil` option as discussed on issue #64. Passing the atom `use_nil` as an option to both encode and decode will replace the atom `null` with `nil` when decoding and encode `nil` as `null` when encoding values. Fixes #64 Fixes #68	il y a 10 ans
Paul J. Davis	99867af6e9	Avoid uint64 for 32bit compatibility Rather than worry about truncation casting from a possibly 64bit value down to a possibly 32bit size_t we just limit the total bytes per invocation to 4G using an unsigned integer. Thanks to @seriyps for the report. Fixes #61	il y a 11 ans
Paul J. Davis	b96de951a2	Initial support for the new map type This patch adds initial support for decoding/encoding to/from the new maps data type. I'd like to thank Jihyun Yu (yjh0502) for the initial versions of this work.	il y a 11 ans
Paul J. Davis	bda503527d	Yield back to Erlang while encoding JSON This adds a configurable limit on the number of bytes produced by the encoder before yielding back to the Erlang VM. This is to avoid the infamous scheduler collapse issues. The `jiffy:encode/2` now takes an option `{bytes_per_iter, pos_integer()}` that controls the yield frequency. The default value is 2048.	il y a 11 ans
Paul J. Davis	e9a102af7d	Yield back to Erlang while decoding JSON This adds a configurable limit on the number of bytes consumed by the decoder before yielding back to the Erlang VM. This is to avoid the infamous scheduler collapse issues. The `jiffy:decode/2` now takes an option `{bytes_per_iter, pos_integer()}` that controls the yield frequency. The default value is 2048.	il y a 11 ans
Paul J. Davis	5ccff57ade	Use a resource for the encoder structure This is ground work to allow Jiffy to yield back to the scheduler. Creating an encoder resource will allow for the necessary state to be carried across NIF function invocations.	il y a 11 ans

1 2

86 Révisions (f74b04fd11a0a21544c90941ae1e239907e4fe67)