redis

Commit Graph

Author	SHA1	Message	Date
antirez	c6db0a7c20	Don't use cross-thread unlocking.	2025-04-02 10:12:03 +02:00
YaacovHazan	c6e5d1d5fe	Merge remote-tracking branch 'upstream/unstable' into HEAD CI / test-sanitizer-address (push) Failing after 32s Details CI / test-ubuntu-latest (push) Failing after 33s Details CI / build-debian-old (push) Failing after 32s Details CI / build-libc-malloc (push) Failing after 32s Details CI / build-centos-jemalloc (push) Failing after 32s Details CI / build-old-chain-jemalloc (push) Failing after 32s Details External Server Tests / test-external-standalone (push) Failing after 32s Details Codecov / code-coverage (push) Failing after 33s Details External Server Tests / test-external-cluster (push) Failing after 31s Details External Server Tests / test-external-nodebug (push) Failing after 31s Details Spellcheck / Spellcheck (push) Failing after 31s Details CI / build-32bit (push) Failing after 31m25s Details CI / build-macos-latest (push) Has been cancelled Details	2025-03-31 21:26:40 +03:00
DvirDukhan	8ea8f4220c	Update RediSearch Makefile - 7.99.90 (#13905 ) CI / test-ubuntu-latest (push) Failing after 32s Details CI / build-debian-old (push) Failing after 31s Details CI / test-sanitizer-address (push) Failing after 31s Details CI / build-32bit (push) Failing after 32s Details CI / build-libc-malloc (push) Failing after 32s Details CI / build-centos-jemalloc (push) Failing after 31s Details CI / build-old-chain-jemalloc (push) Failing after 32s Details Codecov / code-coverage (push) Failing after 31s Details Spellcheck / Spellcheck (push) Failing after 32s Details External Server Tests / test-external-standalone (push) Failing after 31s Details External Server Tests / test-external-cluster (push) Failing after 31s Details External Server Tests / test-external-nodebug (push) Failing after 31s Details Coverity Scan / coverity (push) Has been skipped Details CI / build-macos-latest (push) Has been cancelled Details	2025-03-31 21:26:07 +03:00
Eran Hadad	1c646662e9	Bump module version to v7.99.90 for RedisBloom, JSON and Timeseries (#13908 )	2025-03-31 21:24:22 +03:00
Ozan Tezcan	366c6aff81	Put replica online when bgsave is done (#13895 ) Before https://github.com/redis/redis/pull/13732, replicas were brought online immediately after master wrote the last bytes of the RDB file to the socket. This behavior remains unchanged if rdbchannel replication is not used. However, with rdbchannel replication, the replica is brought online after receiving the first ack which is sent by replica after rdb is loaded. To align the behavior, reverting this change to put replica online once bgsave is done. Additonal changes: - INFO field `mem_total_replication_buffers` will also contain `server.repl_full_sync_buffer.mem_used` which shows accumulated replication stream during rdbchannel replication on replica side. - Deleted debug level logging from some replication tests. These tests generate thousands of keys and it may cause per key logging on some cases.	2025-03-31 13:48:49 +03:00
YaacovHazan	5d887c58ae	Merge unstable into 8.0 (#13901 ) CI / test-ubuntu-latest (push) Failing after 31s Details CI / test-sanitizer-address (push) Failing after 31s Details CI / build-32bit (push) Failing after 31s Details CI / build-libc-malloc (push) Failing after 31s Details CI / build-centos-jemalloc (push) Failing after 31s Details CI / build-debian-old (push) Failing after 51s Details CI / build-old-chain-jemalloc (push) Failing after 31s Details Codecov / code-coverage (push) Failing after 31s Details External Server Tests / test-external-standalone (push) Failing after 30s Details External Server Tests / test-external-cluster (push) Failing after 31s Details External Server Tests / test-external-nodebug (push) Failing after 31s Details Spellcheck / Spellcheck (push) Failing after 1m35s Details CI / build-macos-latest (push) Has been cancelled Details Merging unstable towards GA	2025-03-30 15:11:57 +03:00
Jason	aa8e2d1712	Ignore shardId updates from replica nodes (#13877 ) CI / test-ubuntu-latest (push) Failing after 31s Details CI / test-sanitizer-address (push) Failing after 31s Details CI / build-debian-old (push) Failing after 31s Details CI / build-32bit (push) Failing after 31s Details CI / build-libc-malloc (push) Failing after 31s Details CI / build-centos-jemalloc (push) Failing after 31s Details CI / build-old-chain-jemalloc (push) Failing after 31s Details Codecov / code-coverage (push) Failing after 31s Details Spellcheck / Spellcheck (push) Failing after 31s Details CI / build-macos-latest (push) Has been cancelled Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-standalone (push) Failing after 32s Details External Server Tests / test-external-cluster (push) Failing after 32s Details External Server Tests / test-external-nodebug (push) Failing after 2m18s Details Close https://github.com/redis/redis/issues/13868 This bug was introduced by https://github.com/redis/redis/pull/13468 ## Issue To maintain compatibility with older versions that do not support shardid, when a replica passes a shardid, we also update the master’s shardid accordingly. However, when both the master and replica support shardid, an issue arises: in one moment, the master may pass a shardid, causing us to update both the master and all its replicas to match the master’s shardid. But if the replica later passes a different shardid, we would then update the master’s shardid again, leading to continuous changes in shardid. ## Solution Regardless of the situation, we always ensure that the replica’s shardid remains consistent with the master’s shardid.	2025-03-30 15:15:04 +08:00
YaacovHazan	452b5b8a3b	Merge remote-tracking branch 'upstream/unstable' into HEAD	2025-03-30 09:54:48 +03:00
antirez	3dd48b5b45	README: Random Projection section.	2025-03-28 18:29:56 +01:00
Vitah Lin	057f039c4b	Fix 'RESTORE can set LFU' test (#13896 ) CI / test-ubuntu-latest (push) Failing after 31s Details CI / test-sanitizer-address (push) Failing after 31s Details CI / build-32bit (push) Failing after 31s Details CI / build-libc-malloc (push) Failing after 32s Details CI / build-centos-jemalloc (push) Failing after 32s Details CI / build-debian-old (push) Failing after 48s Details CI / build-old-chain-jemalloc (push) Failing after 31s Details Codecov / code-coverage (push) Failing after 31s Details Spellcheck / Spellcheck (push) Failing after 31s Details CodeQL / Analyze (cpp) (push) Failing after 31s Details CI / build-macos-latest (push) Has been cancelled Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-standalone (push) Failing after 31s Details External Server Tests / test-external-cluster (push) Failing after 31s Details External Server Tests / test-external-nodebug (push) Failing after 31s Details When the `restore foo 0 $encoded freq 100` command and `set freq [r object freq foo]` run in different minute timestamps (i.e., when server.unixtime/60 changes between these operations), the assertion may fail due to the LFU decay. This PR updates the “RESTORE can set LFU” test to verify the actual freq value based on minute timestamps. --------- Co-authored-by: debing.sun <debing.sun@redis.com>	2025-03-28 13:33:58 +08:00
antirez	4dca45ad24	Remove sprintf() from cJSON.	2025-03-27 12:19:56 +01:00
antirez	b17499f907	Fix projection output len.	2025-03-27 12:19:40 +01:00
antirez	29c27bc13e	Make HNSW CAS commit atomic. This way we don't need to mess with node->value at a latter time where an explicit lock would be required. Now we have: 1. Prepare context (neighbors). 2. Commit, and set the associated value.	2025-03-27 12:18:58 +01:00
antirez	c61c535c32	Make Redis module merging simpler. This way there is no need to change any file: the only needed change is the initialization function name, that is now controlled by the define.	2025-03-27 10:45:05 +01:00
antirez	2f17e4fb04	Prettify parseVector().	2025-03-27 08:35:47 +01:00
antirez	63057253d8	Document threading model in a top comment.	2025-03-27 08:31:15 +01:00
antirez	3d31fc3bee	VSIM thread: manipulate results while still locked.	2025-03-27 08:11:13 +01:00
debing.sun	87d8e71708	Fix defrag when type/encoding changes during scan (#13883 ) CI / build-32bit (push) Failing after 32s Details CI / test-sanitizer-address (push) Failing after 32s Details CI / build-libc-malloc (push) Failing after 32s Details CI / build-debian-old (push) Failing after 32s Details CI / build-centos-jemalloc (push) Failing after 31s Details CI / build-old-chain-jemalloc (push) Failing after 31s Details Codecov / code-coverage (push) Failing after 31s Details CI / test-ubuntu-latest (push) Failing after 2m7s Details Spellcheck / Spellcheck (push) Failing after 31s Details CI / build-macos-latest (push) Has been cancelled Details External Server Tests / test-external-standalone (push) Failing after 31s Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-cluster (push) Failing after 32s Details External Server Tests / test-external-nodebug (push) Failing after 31s Details This PR is based on: https://github.com/valkey-io/valkey/pull/1801 [SoftlyRaining](https://github.com/SoftlyRaining) was hunting for defrag bugs with Jim and found a couple of improvements to make. Jim pointed out that in several of the callbacks, if the encoding were to change it simply returns without doing anything to `cursor` to make it reach 0, meaning that it would continue no-op working on that item without making any progress. Type and encoding can change while the defrag scan is in progress if the value is mutated or replaced by something else with the same key. --------- Signed-off-by: Rain Valentine <rsg000@gmail.com> Co-authored-by: Rain Valentine <rsg000@gmail.com>	2025-03-27 08:58:57 +08:00
antirez	f70dc8acb2	Clarify VRANDMEMBER tradeoff.	2025-03-26 23:47:47 +01:00
antirez	9180659f8b	Clarify failure behaior of VectorSetRdbLoad().	2025-03-26 23:44:39 +01:00
antirez	c2d80e8ced	Clarify that if CAS fails we insert blocking.	2025-03-26 23:41:55 +01:00
antirez	e3243819ef	Don't mess with node attributes without protection. The background VSIMs use the node attributes (via the callback) so we can't modify them without waiting for the background operations to terminate.	2025-03-26 23:36:14 +01:00
antirez	a6c8a15cad	VADD: fix leak on thread creation failure.	2025-03-26 22:50:47 +01:00
antirez	3e2649f1f1	hnsw_insert() should never fail in practice. We pass our aborting allocation function to the HNSW lib, the only other reason for it to fail is pthread mutex locking failing but this is also practically impossible AFAIK in modern systems, and if it happens (for kernel reosurces shortage) anyway to abort is the best thing to do: otherwise we would have to return that we could not complete the operation for some reason, which is not uniform with everything Redis does. In Redis under normal conditions writes must succeed if they are semantically correct, or the server crash for OOM.	2025-03-26 22:46:00 +01:00
Ozan Tezcan	a0da8390a2	Fix use-after-free when diskless load config is not swapdb (#13887 ) CI / build-macos-latest (push) Waiting to run Details CI / test-sanitizer-address (push) Failing after 32s Details CI / build-debian-old (push) Failing after 31s Details CI / build-centos-jemalloc (push) Failing after 32s Details CI / build-libc-malloc (push) Failing after 32s Details CI / build-32bit (push) Failing after 32s Details CI / build-old-chain-jemalloc (push) Failing after 32s Details Codecov / code-coverage (push) Failing after 31s Details External Server Tests / test-external-standalone (push) Failing after 32s Details External Server Tests / test-external-cluster (push) Failing after 32s Details External Server Tests / test-external-nodebug (push) Failing after 32s Details CI / test-ubuntu-latest (push) Failing after 1m37s Details Spellcheck / Spellcheck (push) Failing after 32s Details When the diskless load configuration is set to on-empty-db, we retain a pointer to the function library context. When emptyData() is called, it frees this function library context pointer, leading to a use-after-free situation. I refactored code to ensure that emptyData() is called first, followed by retrieving the valid pointer to the function library context. Refactored code should not introduce any runtime implications. Bug introduced by https://github.com/redis/redis/pull/13495 (Redis 8.0) Co-authored-by: Oran Agra <oran@redislabs.com>	2025-03-26 21:50:10 +03:00
antirez	8dfc501fb8	VSIM: fix double free if thread creation fails.	2025-03-26 19:43:59 +01:00
antirez	9d4325ee25	VSIM NOTHREAD, mainly for testing goals.	2025-03-26 16:52:28 +01:00
antirez	707c132392	Count threaded exec time in stats.	2025-03-26 16:48:02 +01:00
antirez	08e3f958fa	README: remove no longer valid RP issue. now the projection matrix is deterministic.	2025-03-26 11:33:32 +01:00
antirez	23b3e21817	README: suggest using FP32 vs VALUES.	2025-03-26 11:28:05 +01:00
Cong Chen	981aa5c12f	Fix timing issue in HEXPIREAT test (#13873 ) CI / build-macos-latest (push) Waiting to run Details CI / build-debian-old (push) Failing after 7s Details CI / build-centos-jemalloc (push) Failing after 3s Details CI / build-old-chain-jemalloc (push) Failing after 3s Details CI / build-32bit (push) Failing after 21s Details Codecov / code-coverage (push) Failing after 8s Details CI / build-libc-malloc (push) Successful in 50s Details CI / test-ubuntu-latest (push) Failing after 2m9s Details CI / test-sanitizer-address (push) Failing after 2m40s Details Spellcheck / Spellcheck (push) Successful in 9m2s Details External Server Tests / test-external-standalone (push) Failing after 32s Details External Server Tests / test-external-cluster (push) Failing after 32s Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-nodebug (push) Failing after 31s Details This fixes an error that occurs in the job [test-valgrind-no-malloc-usable-size-test](https://github.com/redis/redis/actions/runs/13912357739/job/38929051397) of the Daily workflow: ``` *** [err]: HEXPIREAT - Set time and then get TTL (listpackex) in tests/unit/type/hash-field-expire.tcl Expected '999' to be between to '1000' and '2000' (context: type eval line 6 cmd {assert_range [r hpttl myhash FIELDS 1 field1] 1000 2000} proc ::test) ```	2025-03-26 10:00:38 +08:00
antirez	16e3c5a8f9	Locks error checking improved.	2025-03-24 19:10:28 +01:00
antirez	adfd2dc7c0	Remove useless OOM checks, but handle mutex creation failure.	2025-03-24 12:54:41 +01:00
antirez	8bf9b8abc1	Use Hadamard-based projection. Works better and being deterministic (only relative to the projection size) the replicas will have the same matrix automatically.	2025-03-24 12:48:04 +01:00
Oran Agra	2a189709e0	avoid possible use-after-free with module KSN changes (#13875 ) CI / build-debian-old (push) Failing after 4s Details CI / build-centos-jemalloc (push) Failing after 3s Details CI / build-old-chain-jemalloc (push) Failing after 3s Details CI / build-32bit (push) Failing after 18s Details CI / build-libc-malloc (push) Successful in 53s Details CI / test-sanitizer-address (push) Failing after 1m6s Details CI / test-ubuntu-latest (push) Failing after 2m57s Details Spellcheck / Spellcheck (push) Successful in 9m5s Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-cluster (push) Failing after 31s Details External Server Tests / test-external-standalone (push) Failing after 6m35s Details External Server Tests / test-external-nodebug (push) Failing after 15m1s Details CI / build-macos-latest (push) Has been cancelled Details in #13505, we changed the code to use the string value of the key rather than the integer value on the stack, but we have a test in unit/moduleapi/keyspace_events that uses keyspace notification hook to modify the value with RM_StringDMA, which can cause this value to be released before used. the reason it didn't happen so far is because we were using shared integers, so releasing the object doesn't free it.	2025-03-24 12:24:52 +02:00
antirez	958ebee091	README: specify how to add REDUCE in VADD.	2025-03-24 09:55:45 +01:00
Yuan Wang	319bbcc1a7	Fix sdscatprintf error of the in output of `info stats` (#13871 ) CI / build-macos-latest (push) Waiting to run Details CI / build-debian-old (push) Failing after 4s Details CI / build-32bit (push) Failing after 15s Details CI / build-centos-jemalloc (push) Failing after 3s Details CI / build-old-chain-jemalloc (push) Failing after 2s Details CI / test-sanitizer-address (push) Failing after 1m2s Details Codecov / code-coverage (push) Failing after 33s Details CI / build-libc-malloc (push) Successful in 48s Details CI / test-ubuntu-latest (push) Failing after 2m51s Details Spellcheck / Spellcheck (push) Failing after 9s Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-standalone (push) Failing after 33s Details External Server Tests / test-external-nodebug (push) Failing after 32s Details External Server Tests / test-external-cluster (push) Failing after 9m29s Details CI failed: https://github.com/redis/redis/actions/runs/13981749993/job/39148249096, since i don't reassign `info` after `sdscatprintf(info, xxx)` Thanks to @sundb for spotting this introduced in https://github.com/redis/redis/pull/13846	2025-03-24 09:17:58 +08:00
debing.sun	87b7c3ac1a	Fix rax node defragmentaion being skipped (#13847 ) First, when we do `raxSeek()` and then call raxNext, we will get the `RAX_ITER_JUST_SEEKED` flag and return success directly. We always set the node defrag callback after `raxSeek()`, which means that when we break from defragmentation, the first node that comes in again will never be defragged. In this PR, we save the last as the next node to be processed, not the last node to be completed. This way we defrag the next node when we exit to avoid it being skipped on the next resume. --------- Co-authored-by: oranagra <oran@redislabs.com>	2025-03-24 08:57:08 +08:00
antirez	8007ccd51b	Use RESP3-friendly bool replies.	2025-03-23 20:14:40 +01:00
antirez	9cc750fd66	Test: projection regression test fixed.	2025-03-23 15:04:58 +01:00
antirez	aa92b37589	VINFO: use a single field for random projection info.	2025-03-23 14:49:52 +01:00
antirez	8f479b22b9	Tests: replication test.	2025-03-23 14:45:34 +01:00
Salvatore Sanfilippo	854c7fdddb	Merge pull request #6 from rowantrollope/main Fix possible crash with random projection	2025-03-23 14:44:53 +01:00
Rowan Trollope	31bc07955c	Fix possible crash with random projection	2025-03-22 09:11:20 -07:00
antirez	f330d6175a	Clarify HNSW_MAX_THREADS vs one thread per request.	2025-03-20 15:42:11 +01:00
Benson-li	427c36888e	Fix potential infinite loop of RANDOMKEY during client pause (#13863 ) CI / test-ubuntu-latest (push) Failing after 31s Details CI / build-debian-old (push) Failing after 32s Details CI / build-libc-malloc (push) Failing after 31s Details CI / build-centos-jemalloc (push) Failing after 31s Details CI / build-old-chain-jemalloc (push) Failing after 32s Details Codecov / code-coverage (push) Failing after 32s Details Spellcheck / Spellcheck (push) Failing after 32s Details CI / test-sanitizer-address (push) Failing after 4m35s Details CI / build-32bit (push) Failing after 5m35s Details CI / build-macos-latest (push) Has been cancelled Details CodeQL / Analyze (cpp) (push) Failing after 32s Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-standalone (push) Failing after 31s Details External Server Tests / test-external-cluster (push) Failing after 31s Details External Server Tests / test-external-nodebug (push) Failing after 6m47s Details The bug mentioned in this [#13862](https://github.com/redis/redis/issues/13862) has been fixed. --------- Signed-off-by: li-benson <1260437731@qq.com> Signed-off-by: youngmore1024 <youngmore1024@outlook.com> Co-authored-by: youngmore1024 <youngmore1024@outlook.com>	2025-03-20 21:32:12 +08:00
debing.sun	cb02bd190b	Fix timing issue in module defrag test (#13870 ) After #13840, the data we populate becomes more complex and slower, we always wait for a defragmentation cycle to end before verifying that the test is okay. However, in some slow environments, an entire defragmentation cycle can exceed 5 seconds, and in my local test using 'taskset -c 0' it can reach 6 seconds, so increase the threshold to avoid test failures.	2025-03-20 21:22:47 +08:00
Yuan Wang	951ec79654	Cluster compatibility check (#13846 ) CI / build-macos-latest (push) Waiting to run Details CI / build-32bit (push) Failing after 31s Details CI / build-libc-malloc (push) Failing after 31s Details CI / build-debian-old (push) Failing after 1m32s Details CI / build-old-chain-jemalloc (push) Failing after 31s Details Codecov / code-coverage (push) Failing after 31s Details CI / test-ubuntu-latest (push) Failing after 3m21s Details Spellcheck / Spellcheck (push) Failing after 31s Details CI / test-sanitizer-address (push) Failing after 6m36s Details CI / build-centos-jemalloc (push) Failing after 6m36s Details External Server Tests / test-external-standalone (push) Failing after 2m10s Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-nodebug (push) Failing after 2m12s Details External Server Tests / test-external-cluster (push) Failing after 2m16s Details ### Background The program runs normally in standalone mode, but migrating to cluster mode may cause errors, this is because some cross slot commands can not run in cluster mode. We should provide an approach to detect this issue when running in standalone mode, and need to expose a metric which indicates the usage of no incompatible commands. ### Solution To avoid perf impact, we introduce a new config `cluster-compatibility-sample-ratio` which define the sampling ratio (0-100) for checking command compatibility in cluster mode. When a command is executed, it is sampled at the specified ratio to determine if it complies with Redis cluster constraints, such as cross-slot restrictions. A new metric is exposed: `cluster_incompatible_ops` in `info stats` output. The following operations will be considered incompatible operations. - cross-slot command If a command has multiple cross slot keys, it is incompatible - `swap, copy, move, select` command These commands involve multi databases in some cases, we don't allow multiple DB in cluster mode, so there are not compatible - Module command with `no-cluster` flag If a module command has `no-cluster` flag, we will encounter an error when loading module, leading to fail to load module if cluster is enabled, so this is incompatible. - Script/function with `no-cluster` flag Similar with module command, if we declare `no-cluster` in shebang of script/function, we also can not run it in cluster mode - `sort` command by/get pattern When `sort` command has `by/get` pattern option, we must ask that the pattern slot is equal with the slot of keys, otherwise it is incompatible in cluster mode. - The script/function command accesses the keys and declared keys have different slots For the script/function command, we not only check the slot of declared keys, but only check the slot the accessing keys, if they are different, we think it is incompatible. Besides, commands like `keys, scan, flushall, script/function flush`, that in standalone mode iterate over all data to perform the operation, are only valid for the server that executes the command in cluster mode and are not broadcasted. However, this does not lead to errors, so we do not consider them as incompatible commands. ### Performance impact test cross slot test Below are the test commands and results. When using MSET with 8 keys, performance drops by approximately 3%. single key test It may be due to the overhead of the sampling function, and single-key commands could cause a 1-2% performance drop.	2025-03-20 10:35:53 +08:00
Filipe Oliveira (Redis)	3e012c9260	Fix string2d usage in case of hexadecimal strings parsing and overflow (#13845 ) CI / build-macos-latest (push) Waiting to run Details CI / build-debian-old (push) Failing after 6s Details CI / build-centos-jemalloc (push) Failing after 5s Details CI / build-old-chain-jemalloc (push) Failing after 3s Details Codecov / code-coverage (push) Failing after 7s Details CI / build-libc-malloc (push) Successful in 56s Details CI / test-sanitizer-address (push) Failing after 1m8s Details CI / test-ubuntu-latest (push) Failing after 2m13s Details CI / build-32bit (push) Failing after 3m28s Details Coverity Scan / coverity (push) Has been skipped Details External Server Tests / test-external-nodebug (push) Failing after 1m48s Details External Server Tests / test-external-standalone (push) Failing after 2m9s Details External Server Tests / test-external-cluster (push) Failing after 2m14s Details Spellcheck / Spellcheck (push) Successful in 9m3s Details Since https://github.com/redis/redis/pull/11884, what was previously accepted as a valid input (hexadecimal string) before 8.0 returned an error. This PR addresses it. To avoid performance penalties if hints the compiler that the fallbacks are not likely to happen. Furthermore, we were ignoring std::result_out_of_range outputs from fast_float. This PR addresses it as well and includes tests for both identified scenarios. --------- Co-authored-by: debing.sun <debing.sun@redis.com>	2025-03-19 20:08:45 +08:00
antirez	758e963a4e	VRANDMEMBER documentation.	2025-03-19 09:02:15 +01:00

1 2 3 4 5 ...

12620 Commits All Branches Search

12620 Commits

All Branches