hash: Support custom algo parameters #6400

weltling · 2020-11-04T13:23:19Z

The concrete need on this change is to support passing an initial seed
to the murmur hash. Passing a custom seed is important in terms of
randomizing the hash function.

The suggested implementation adds a HashTable parameter to all the
init callbacks. Further on, an array with custom arguments is accepted
from hash or hash_init from the user land. Currently several things
like hash_hkdf are not touched, as they don't need passing custom
args.

Some convenience macros have been added to the SHA/MD families of
functions, so the consuming code doesn't have to be changed widely.

Special note on what has happened to
ext/hash/tests/hash_serialize_003.phpt - the test gathers
serialization strings produced on different platforms. As the
implementation requires adding a new property to the hash context
object, it changes the serialization string. However, it doesn't seem
practical to repeat all the serialization strings again. The test is
changed so it tests that the string can be unserialized on the same
platform it was produced on at the runtime.

Another way to implement this is to add another type of the init that
would accept a HT with arguments. However, that would still require
touching all the context structs in all the algos. That would also
increase the size of those structs. As an init function is called just
once, the way of modifying the existing init callback has been seen
preferrable.

Signed-off-by: Anatol Belski ab@php.net

weltling · 2020-11-04T13:24:30Z

@m6w6 @nikic ^

ext/hash/hash.stub.php

weltling · 2020-11-16T13:00:19Z

Poke on this one.

Thanks.

weltling · 2020-11-18T12:43:43Z

@m6w6 ping :)

m6w6

While all those FooInitArgs shenanigans provide API compatibility they do not so for the ABI, but I think this is how we always did for X.y releases?

I wonder, though, we could have cut down the number of "provided" hash algos into half, it there was a "output bits" parameter for all those Foo<NBITS> algos..., couldn't we? Maybe something to deprecate and provision in the next major...

LGTM, all in all.

ext/hash/hash.c

ext/hash/hash.stub.php

ext/hash/hash.c

weltling · 2020-11-19T22:02:57Z

Thanks for the reviews, i'll be working to address this.

Regarding reducing the number of hashes - yep, it would've allowed to have say SHA, and pass the 1/256/512/etc. Other arbitrary nice things can be done with the custom args mechanism. Like, fe, the byteorder of the final hash could be reversed, etc. And of course, the necessary parts like passing a starting seed, will be made now possible to do.

Thanks.

weltling · 2020-11-29T12:03:54Z

I think this one is ready to be revaluated now.

Thansk.

ext/hash/hash.c

ext/hash/hash_murmur.c

ext/hash/tests/hash_serialize_003.phpt

kohler

This review is one-track :) It doesn't seem necessary to store the args in the hashcontext object; instead they should be used only in hash context initialization. I scanned through and see no reason to store args, even for cloning a hashcontext. There are benefits to concentrating all hash-algo-specific information in the hashcontext, and to keeping hashcontexts simpler.

ext/hash/hash.c

ext/hash/php_hash.h

nikic · 2020-12-01T14:29:16Z

This review is one-track :) It doesn't seem necessary to store the args in the hashcontext object; instead they should be used only in hash context initialization. I scanned through and see no reason to store args, even for cloning a hashcontext. There are benefits to concentrating all hash-algo-specific information in the hashcontext, and to keeping hashcontexts simpler.

Great point! I agree that leaving storage of arguments up to the hash implementation is better. I expect that in most cases no explicit storage will be needed, as is the case for murmur, where the extra args just affect how the state is initialized.

m6w6 · 2020-12-01T15:12:12Z

🤦 of course, this makes a lot of sense!

weltling · 2020-12-01T19:21:59Z

OK, lets get it in by that. Easier for me as i don't have to touch that test for now. I'm still sceptic as it's inconsistent and once it's out in stable, that inconsistency is engraved on stone. Still it's master, so it'll be clarified better when more hashes with custom args flow in.

Thanks.

nikic

This looks good to me. Only thing I would request is to rename $args -> $options in the public API, as that matches how we call such arguments elsewhere.

ext/hash/hash.stub.php

ext/hash/hash_murmur.c

The concrete need on this change is to support passing an initial seed to the murmur hash. Passing a custom seed is important in terms of randomizing the hash function. The suggested implementation adds a HashTable parameter to all the init callbacks. Further on, an array with custom arguments is accepted from `hash` or `hash_init` from the user land. Currently several things like `hash_hkdf` are not touched, as they don't need passing custom args. Some convenience macros have been added to the SHA/MD families of functions, so the consuming code doesn't have to be changed widely. Special note on what has happened to `ext/hash/tests/hash_serialize_003.phpt` - the test gathers serialization strings produced on different platforms. As the implementation requires adding a new property to the hash context object, it changes the serialization string. However, it doesn't seem practical to repeat all the serialization strings again. The test is changed so it tests that the string can be unserialized on the same platform it was produced on at the runtime. Another way to implement this is to add another type of the init that would accept a HT with arguments. However, that would still require touching all the context structs in all the algos. That would also increase the size of those structs. As an init function is called just once, the way of modifying the existing init callback has been seen preferrable. Signed-off-by: Anatol Belski <ab@php.net>

Signed-off-by: Anatol Belski <ab@php.net>

- Deref passed seed arg - Don't dup args array Signed-off-by: Anatol Belski <ab@php.net>

Signed-off-by: Anatol Belski <ab@php.net>

As discussed, that's legacy and should not be touched. Signed-off-by: Anatol Belski <ab@php.net>

Signed-off-by: Anatol Belski <ab@php.net>

ext/hash/hash.c

ext/hash/hash_murmur.c

Co-authored-by: Nikita Popov <nikita.ppv@googlemail.com>

…`hash*()` functions > Hash > > The following functions `hash()`, `hash_file()`, and `hash_init()` now support an additional optional `options` argument, which can be used to pass algorithm specific data. Includes unit tests. Refs: * https://www.php.net/manual/en/migration81.new-features.php#migration81.new-features.hash * php/php-src#6400 * php/php-src@110b4e9

kocsismate reviewed Nov 4, 2020

View reviewed changes

ext/hash/hash.stub.php Outdated Show resolved Hide resolved

m6w6 approved these changes Nov 19, 2020

View reviewed changes

ext/hash/hash.c Outdated Show resolved Hide resolved

ext/hash/hash.stub.php Outdated Show resolved Hide resolved

nikic reviewed Nov 19, 2020

View reviewed changes

ext/hash/hash.c Outdated Show resolved Hide resolved

nikic reviewed Nov 30, 2020

View reviewed changes

weltling force-pushed the hash_custom_algo_args branch from 7f4f324 to 579fd5e Compare November 30, 2020 20:21

kohler suggested changes Nov 30, 2020

View reviewed changes

ext/hash/hash.c Outdated Show resolved Hide resolved

ext/hash/hash.c Outdated Show resolved Hide resolved

ext/hash/hash.c Outdated Show resolved Hide resolved

ext/hash/hash.c Outdated Show resolved Hide resolved

ext/hash/php_hash.h Outdated Show resolved Hide resolved

weltling force-pushed the hash_custom_algo_args branch from 579fd5e to e62e9f6 Compare December 1, 2020 19:25

nikic approved these changes Dec 3, 2020

View reviewed changes

ext/hash/hash.stub.php Outdated Show resolved Hide resolved

ext/hash/hash.stub.php Outdated Show resolved Hide resolved

nikic reviewed Dec 3, 2020

View reviewed changes

ext/hash/hash_murmur.c Show resolved Hide resolved

weltling added 7 commits December 5, 2020 20:19

hash: Switch to [] by default for the args parameter

a0b85d6

Signed-off-by: Anatol Belski <ab@php.net>

hash: The args array is always mutable

e220fd9

Signed-off-by: Anatol Belski <ab@php.net>

hash: Address PR comments

f1dc8e1

- Deref passed seed arg - Don't dup args array Signed-off-by: Anatol Belski <ab@php.net>

hash: Revert change to ext/hash/tests/hash_serialize_003.phpt

4886816

Signed-off-by: Anatol Belski <ab@php.net>

hash: Don't carry passed initialization args through

d468b5b

Signed-off-by: Anatol Belski <ab@php.net>

hash: Remove missed remains args member

1271bff

Signed-off-by: Anatol Belski <ab@php.net>

weltling force-pushed the hash_custom_algo_args branch from de3c688 to 1271bff Compare December 5, 2020 19:20

weltling added 3 commits December 5, 2020 20:28

hash: Revert args addition to mhash variants

9c8d6ef

As discussed, that's legacy and should not be touched. Signed-off-by: Anatol Belski <ab@php.net>

hash: Rename $args -> $options

1655a7b

Signed-off-by: Anatol Belski <ab@php.net>

hash: Fix options test

24410e0

Signed-off-by: Anatol Belski <ab@php.net>

nikic reviewed Dec 9, 2020

View reviewed changes

ext/hash/hash.c Outdated Show resolved Hide resolved

ext/hash/hash.c Outdated Show resolved Hide resolved

ext/hash/hash_murmur.c Show resolved Hide resolved

Update ext/hash/hash.c

bb658d1

Co-authored-by: Nikita Popov <nikita.ppv@googlemail.com>

Update ext/hash/hash.c

530c7d3

Co-authored-by: Nikita Popov <nikita.ppv@googlemail.com>

php-pulls closed this in 110b4e9 Dec 13, 2020

kocsismate mentioned this pull request Mar 16, 2021

Mysqli bind in execute #6271

Merged

jrfnl mentioned this pull request Mar 9, 2022

PHP 8.1 | NewFunctionParameters: account for PHP 8.1 changes PHPCompatibility/PHPCompatibility#1326

Merged

hash: Support custom algo parameters #6400

hash: Support custom algo parameters #6400

Uh oh!

Conversation

weltling commented Nov 4, 2020

Uh oh!

weltling commented Nov 4, 2020

Uh oh!

Uh oh!

weltling commented Nov 16, 2020

Uh oh!

weltling commented Nov 18, 2020

Uh oh!

m6w6 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

weltling commented Nov 19, 2020

Uh oh!

weltling commented Nov 29, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kohler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nikic commented Dec 1, 2020

Uh oh!

m6w6 commented Dec 1, 2020

Uh oh!

weltling commented Dec 1, 2020

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!