redis

Commit Graph

Author	SHA1	Message	Date
antirez	519c9e11d1	Allow PUBSUB NUMSUB without channels. The result is an empty list but it is handy to call it programmatically.	2013-06-20 15:34:56 +02:00
antirez	455563faec	PUBSUB command implemented. Currently it implements three subcommands: PUBSUB CHANNELS [<pattern>] List channels with non-zero subscribers. PUBSUB NUMSUB [channel_1 ...] List number of subscribers for channels. PUBSUB NUMPAT Return number of subscribed patterns.	2013-06-20 15:32:00 +02:00
antirez	4c0f8c4e5a	Sentinel: parse new INFO replication output correctly. Sentinel was not able to detect slaves when connected to a very recent version of Redis master since a previos non-backward compatible change to INFO broken the parsing of the slaves ip:port INFO output. This fixes issue #1164	2013-06-20 10:23:23 +02:00
antirez	d363299af3	Allow writes from scripts called by AOF loading in read-only slaves. This fixes issue #1163	2013-06-19 18:25:03 +02:00
antirez	338cd4835d	Fix logStackTrace() when logging to stdout. When the semantics changed from logfile = NULL to logfile = "" to log into standard output, no proper change was made to logStackTrace() to make it able to work with the new setup. This commit fixes the issue.	2013-06-19 14:44:40 +02:00
antirez	9c2c878e45	Lua script errors format more unified. lua_pcall error handler now formats errors in a way more similar to luaPushError() so that errors generated in different contexts look alike.	2013-06-18 19:30:56 +02:00
antirez	51adc6e1bc	Lua scripting: improve error reporting. When calling Lua scripts we try to report not just the error but information about the code line causing the error.	2013-06-18 17:33:35 +02:00
Salvatore Sanfilippo	b96ba52cfa	Merge pull request #1124 from ioddly/fix-issue-1121 Try to report source of bad Lua API calls	2013-06-17 03:14:31 -07:00
antirez	dfc98dccf4	Cluster: detect nodes address change.	2013-06-12 10:50:07 -07:00
antirez	d427373f01	clusterProcessPacket() comments improved for correctness.	2013-06-11 21:34:34 +02:00
antirez	cf71b82111	Binary safe dump of object content in redisLogObjectDebugInfo().	2013-06-04 15:53:53 +02:00
antirez	b6a2878aa5	CONFIG SET: accept slave-priority zero, it is valid.	2013-05-31 19:31:36 +02:00
antirez	915c06a96c	Refresh good slaves count after CONFIG SET min-slaves-... This way just after the CONFIG SET enabling the min-slaves feature it is possible to write to the database without delays.	2013-05-30 12:23:41 +02:00
antirez	88441bf18f	New INFO field "min_slaves_good_slaves". When min-slaves-to-write feature is active, this field reports the number of slaves considered good (online state, lag within the specified range).	2013-05-30 12:18:31 +02:00
antirez	1a54d5963e	Refresh good slaves count when setting slave state as online.	2013-05-30 12:13:25 +02:00
antirez	2ec7875cbf	min-replicas-to-write: only deny write commands. I guess I needed another coffee...	2013-05-30 11:30:09 +02:00
antirez	ed599d3aca	min-slaves-to-write: don't accept writes with less than N replicas. This feature allows the user to specify the minimum number of connected replicas having a lag less or equal than the specified amount of seconds for writes to be accepted.	2013-05-30 11:30:04 +02:00
antirez	888400ebd5	repl_offset field in INFO replication is now just offset.	2013-05-29 19:56:33 +02:00
antirez	37c29e037b	Slaves list in INFO output: lag added, format changed. There is a new 'lag' information in the list of slaves, in the "replication" section of the INFO output. Also the format was changed in a backward incompatible way in order to make it more easy to parse if new fields are added in the future, as the new format is comma separated but has named fields (no longer positional fields).	2013-05-29 19:54:44 +02:00
antirez	091ed386f7	Accept REPLCONF in any state.	2013-05-28 15:26:20 +02:00
antirez	a864cae2a5	A comment about BLPOP timeout did not reflected actual behavior.	2013-05-27 19:34:14 +02:00
antirez	1909b8753d	Version bumped to 2.9.11	2013-05-27 11:44:04 +02:00
antirez	3c82c85fcf	Close connection with timedout slaves. Now masters, using the time at which the last REPLCONF ACK was received, are able to explicitly disconnect slaves that are no longer responding. Previously the only chance was to see a very long output buffer, that was highly suboptimal.	2013-05-27 11:42:42 +02:00
antirez	e06a560466	Send ACK to master once every second. ACKs can be also used as a base for synchronous replication. However in that case they'll be explicitly requested by the master when the client sends a request that needs to be replicated synchronously.	2013-05-27 11:42:38 +02:00
antirez	efd87031d0	Don't ACK the master after every command. Sending an ACK is now moved into the replicationSendAck() function.	2013-05-27 11:42:35 +02:00
antirez	dd0adbb777	Make sure that REPLCONF ACK really has no return value.	2013-05-27 11:42:30 +02:00
antirez	0292c5f7ae	Replication: send REPLCONF ACK to master.	2013-05-27 11:42:25 +02:00
antirez	6b4635f4f5	REPLCONF ACK command. This special command is used by the slave to inform the master the amount of replication stream it currently consumed. it does not return anything so that we not need to consume additional bandwidth needed by the master to reply something. The master can do a number of things knowing the amount of stream processed, such as understanding the "lag" in bytes of the slave, verify if a given command was already processed by the slave, and so forth.	2013-05-27 11:42:17 +02:00
antirez	a950f48906	Fixed a bug in no queueing replies to master.	2013-05-25 01:00:07 +02:00
antirez	dd937030f0	Replication: don't even queue replies to master commands. When master send commands, there is no need for the slave to reply. Redis used to queue the reply in the output buffer and discard the reply later, this is a waste of work and it is not clear why it was this way (I sincerely don't remember). This commit changes it in order to don't queue the reply at all. All tests passing.	2013-05-24 18:58:57 +02:00
antirez	c853239a5e	Top comment for prepareClientToWrite() clarified. We don't write the output buffer to the client socket for slaves only if the slave is not online.	2013-05-24 18:41:43 +02:00
ioddly	c3229c33b1	Try to report source of bad Lua API calls	2013-05-22 18:17:58 -05:00
antirez	5d0f408d42	Don't stop reading redis.conf if line has no args. Should be "continue" and was "return". This fixes issue #1110	2013-05-18 16:21:52 +02:00
YAMAMOTO Takashi	9fcead7a59	don't assume time_t == long time_t is always 64bit on recent versions of NetBSD.	2013-05-17 17:22:39 +09:00
YAMAMOTO Takashi	ae82867664	use nanosleep instead of usleep SUSv3 says that: The useconds argument shall be less than one million. If the value of useconds is 0, then the call has no effect. and actually NetBSD's implementation rejects such a value with EINVAL. use nanosleep which has no such a limitation instead.	2013-05-17 17:21:28 +09:00
YAMAMOTO Takashi	b2dd0849ce	rename popcount to popcount_binary to avoid a conflict with NetBSD libc NetBSD-current's libc has a function named popcount. hiding these extensions using feature macros is not possible because redis uses other extensions covered by the same feature macro. eg. inet_aton	2013-05-17 17:21:28 +09:00
YAMAMOTO Takashi	0a20ad40de	don't define _XOPEN_SOURCE for NetBSD on NetBSD, defining _XOPEN_SOURCE hides extensions like inet_aton, strcasecmp, etc.	2013-05-17 17:19:02 +09:00
antirez	92f18d04a0	Merge branch 'config-rewrite' into unstable	2013-05-15 11:58:21 +02:00
antirez	78167807be	Use memtoll() when parsing the backlog-size option.	2013-05-15 11:55:14 +02:00
antirez	25f2be6267	CONFIG REWRITE: backlog size is a bytes option.	2013-05-15 11:39:29 +02:00
antirez	cd48e4fc40	CONFIG REWRITE: bindaddr -> bind.	2013-05-15 11:38:43 +02:00
antirez	8a44e6c490	CONFIG REWRITE: when rewriting amount of bytes use GB, MB, KB if possible.	2013-05-15 11:33:02 +02:00
antirez	c590e18d15	CONFIG REWRITE: correctly escape the notify-keyspace-events option.	2013-05-15 11:15:31 +02:00
antirez	328843849f	CONFIG REWRITE: "active-rehashing" -> "activerehashing".	2013-05-15 11:09:19 +02:00
antirez	402a0f1ca0	CONFIG REWRITE: fixed typo in AOF fsync policy.	2013-05-15 11:06:56 +02:00
antirez	72e980231c	CONFIG REWRITE: repl-disable-tcp-nodelay is a boolean option.	2013-05-15 11:04:53 +02:00
antirez	310dbba01c	Added a define for most configuration defaults. Also the logfile option was modified to always have an explicit value and to log to stdout when an empty string is used as log file. Previously there was special handling of the string "stdout" that set the logfile to NULL, this always required some special handling.	2013-05-15 10:12:29 +02:00
antirez	9e74d216e1	CONFIG REWRITE: Use sane perms when creating redis.conf from scratch.	2013-05-14 12:45:04 +02:00
antirez	e252045454	CONFIG REWRITE: actually rewrite the config file, atomically.	2013-05-14 12:32:25 +02:00
antirez	25e049cc68	redis-cli: help.h updated.	2013-05-14 11:23:16 +02:00
antirez	78f94d90eb	CONFIG REWRITE: remove orphaned lines.	2013-05-14 11:17:18 +02:00
antirez	ee721f1e5c	CONFIG REWRITE: strip multiple empty lines.	2013-05-14 10:22:55 +02:00
antirez	c184f36d21	CONFIG REWRITE: support for client-output-buffer-limit.	2013-05-13 18:34:18 +02:00
antirez	d95592b116	CONFIG REWRITE: support for dir and slaveof.	2013-05-13 11:26:49 +02:00
antirez	2bc31e55ee	CONFIG REWRITE: support for save and syslog-facility.	2013-05-13 11:11:45 +02:00
antirez	7e049fafd3	CONFIG REWRITE: Initial support code and design.	2013-05-13 11:11:12 +02:00
antirez	5947f170f9	Obtain absoute path of configuration file, expose it in INFO.	2013-05-09 16:57:59 +02:00
antirez	a7486a6592	Version bumped to 2.9.10	2013-05-08 15:24:40 +02:00
antirez	0ae1b5b0a1	Revert "use long long instead of size_t make it more safe" This reverts commit `2c75f2cf1a`. After further analysis, it is very unlikely that we'll raise the string size limit to > 512MB, and at the same time such big strings will be used in 32 bit systems. Better to revert to size_t so that 32 bit processors will not be forced to use a 64 bit counter in normal operations, that is currently completely useless.	2013-05-08 10:01:27 +02:00
Jiahao Huang	2c75f2cf1a	use long long instead of size_t make it more safe	2013-05-07 23:37:22 +08:00
Jiahao Huang	e3ed78d43b	in 32bit machine, popcount don't work with a input string length up to 512 MB, bitcount commant may return negtive integer with string length more than 256 MB	2013-05-07 19:55:57 +08:00
antirez	5c9f6d4f55	Cluster: link reconnection on delayed PONG reply. When the PONG delay is half the cluster node timeout, the link gets disconnected (and later automatically reconnected) in order to ensure that it's not just a dead connection issue. However this operation is only performed if the link is old enough, in order to avoid to disconnect the same link again and again (and among the other problems, never receive the PONG because of that). Note: when the link is reconnected, the 'ping_sent' field is not updated even if a new ping is sent using the new connection, so we can still reliably detect a node ping timeout.	2013-05-03 15:43:03 +02:00
antirez	1315b9f246	Cluster: restore PING sent time on reconnections.	2013-05-03 15:42:59 +02:00
antirez	ae71731019	Cluster: PING/PONG handling redesigned.	2013-05-03 15:42:38 +02:00
antirez	a120560f70	Cluster: process config from PING packets as we do for PONG. Also clusterBroadcastPing() was renamed into clusterBroadcastPong() that's what the function is actually doing.	2013-05-03 15:41:34 +02:00
antirez	8a51c067ad	Cluster: createClusterLink() comment fixed for grammar.	2013-05-03 15:41:29 +02:00
antirez	0d9c1f536b	CONFIG SET server.masterauth aesthetic change. This is just to make the code exactly like the above instance used for requirepass. No actual change nor the original code violated the Redis coding style.	2013-05-02 17:20:49 +02:00
Michel Martens	649b304e0f	Reset masterauth if an empty string is configured.	2013-05-02 17:19:11 +02:00
charsyam	9cd06e4406	Fix AOF bug: expire could be removed from key on AOF rewrite. There was a race condition in the AOF rewrite code that, with bad enough timing, could cause a volatile key just about to expire to be turned into a non-volatile key. The bug was never reported to cause actualy issues, but was found analytically by an user in the Redis mailing list: https://groups.google.com/forum/?fromgroups=#!topic/redis-db/Kvh2FAGK4Uk This commit fixes issue #1079.	2013-05-02 15:35:59 +02:00
antirez	e5ef85c444	Sentinel: changes to tilt mode. Tilt mode was too aggressive (not processing INFO output), this resulted in a few problems: 1) Redirections were not followed when in tilt mode. This opened a window to misinform clients about the current master when a Sentinel was in tilt mode and a fail over happened during the time it was not able to update the state. 2) It was possible for a Sentinel exiting tilt mode to detect a false fail over start, if a slave rebooted with a wrong configuration about at the same time. This used to happen since in tilt mode we lose the information that the runid changed (reboot). Now instead the Sentinel in tilt mode will still remove the instance from the list of slaves if it changes state AND runid at the same time. Both are edge conditions but the changes should overall improve the reliability of Sentinel.	2013-04-30 15:08:29 +02:00
antirez	ef05a78e7e	Sentinel: more sensible delay in master demote after tilt.	2013-04-30 15:08:22 +02:00
antirez	48ede0d84d	Sentinel: only demote old master into slave under certain conditions. We used to always turn a master into a slave if the DEMOTE flag was set, as this was a resurrecting master instance. However the following race condition is possible for a Sentinel that got partitioned or internal issues (tilt mode), and was not able to refresh the state in the meantime: 1) Sentinel X is running, master is instance "A". 3) "A" fails, sentinels will promote slave "B" as master. 2) Sentinel X goes down because of a network partition. 4) "A" returns available, Sentinels will demote it as a slave. 5) "B" fails, other Sentinels will promote slave "A" as master. 6) At this point Sentinel X comes back. When "X" comes back he thinks that: "B" is the master. "A" is the slave to demote. We want to avoid that Sentinel "X" will demote "A" into a slave. We also want that Sentinel "X" will detect that the conditions changed and will reconfigure itself to monitor the right master. There are two main ways for the Sentinel to reconfigure itself after this event: 1) If "B" is reachable and already configured as a slave by other sentinels, "X" will perform a redirection to "A". 2) If there are not the conditions to demote "A", the fact that "A" reports to be a master will trigger a failover detection in "X", that will end into a reconfiguraiton to monitor "A". However if the Sentinel was not reachable, its state may not be updated, so in case it titled, or was partiitoned from the master instance of the slave to demote, the new implementation waits some time (enough to guarantee we can detect the new INFO, and new DOWN conditions). If after some time still there are not the right condiitons to demote the instance, the DEMOTE flag is cleared.	2013-04-26 17:02:13 +02:00
antirez	1965e22aa1	Sentinel: always redirect on master->slave transition. Sentinel redirected to the master if the instance changed runid or it was the first time we got INFO, and a role change was detected from master to slave. While this is a good idea in case of slave->master, since otherwise we could detect a failover without good reasons just after a reboot with a slave with a wrong configuration, in the case of master->slave transition is much better to always perform the redirection for the following reasons: 1) A Sentinel may go down for some time. When it is back online there is no other way to understand there was a failover. 2) Pointing clients to a slave seems to be always the wrong thing to do. 3) There is no good rationale about handling things differently once an instance is rebooted (runid change) in that case.	2013-04-24 11:30:17 +02:00
antirez	d264122f6a	Config option to turn AOF rewrite incremental fsync on/off.	2013-04-24 10:57:07 +02:00
antirez	336d722fba	AOF: sync data on disk every 32MB when rewriting. This prevents the kernel from putting too much stuff in the output buffers, doing too heavy I/O all at once. So the goal of this commit is to split the disk pressure due to the AOF rewrite process into smaller spikes. Please see issue #1019 for more information.	2013-04-24 10:26:31 +02:00
antirez	91f4213ddf	rio.c: added ability to fdatasync() from time to time while writing.	2013-04-24 10:26:30 +02:00
antirez	8e222c888f	Sentinel: turn old master into a slave when it comes back.	2013-04-19 16:47:24 +02:00
antirez	9d823fc222	More explicit panic message on out of memory.	2013-04-19 15:11:34 +02:00
xiaost7	ecdbaf4695	Cluster: fix clusterNode.name print format on debug message. It was %40s instead of %.40s, and since the string is not null terminated it caused random garbage to be displayed, and possibly a crash.	2013-04-19 09:53:43 +02:00
antirez	f8ae70cf7c	redis-cli: raise error on bad command line switch. Previously redis-cli never tried to raise an error when an unrecognized switch was encountered, as everything after the initial options is to be transmitted to the server. However this is too liberal, as there are no commands starting with "-". So the new behavior is to produce an error if there is an unrecognized switch starting with "-". This should not break past redis-cli usages but should prevent broken options to be silently discarded. As far the first token not starting with "-" is encountered, all the rest is considered to be part of the command, so you cna still use strings starting with "-" as values, like in: redis-cli --port 6380 set foo --my-value	2013-04-11 13:17:25 +02:00
antirez	0280c2f252	redis-cli: --latency-history mode implemented.	2013-04-11 13:11:41 +02:00
antirez	b84570dece	Cluster: reconfigure additonal slaves on failover.	2013-04-09 12:13:26 +02:00
antirez	d1aee359c0	Cluster: CONFIG SET cluster-node-timeout.	2013-04-09 11:29:51 +02:00
antirez	68cf249f81	Cluster: use server.cluster_node_timeout directly. We used to copy this value into the server.cluster structure, however this was not necessary. The reason why we don't directly use server.cluster->node_timeout is that things that can be configured via redis.conf need to be directly available in the server structure as server.cluster is allocated later only if needed in order to reduce the memory footprint of non-cluster instances.	2013-04-09 11:24:18 +02:00
antirez	ef4f25ff6e	Cluster: configdigest field no longer used. Removed.	2013-04-09 11:07:25 +02:00
antirez	f09b2508f4	Cluster: properly send ping to nodes not pinged foro too much time. In commit `d728ec6` it was introduced the concept of sending a ping to every node not receiving a ping since node_timeout/2 seconds. However the code was located in a place that was not executed because of a previous conditional causing the loop to re-iterate. This caused false positives in nodes availability detection. The current code is still not perfect as a node may be detected to be in PFAIL state even if it does not reply for just node_timeout/2 seconds that is not correct. There is a plan to improve this code ASAP.	2013-04-08 19:40:20 +02:00
antirez	d5b383477e	Cluster: move REDIS_CLUSTER_FAILOVER_DELAY near other timing defines.	2013-04-04 14:23:34 +02:00
antirez	3cc6e7d01d	Cluster: CONFIG GET cluster-node-timeout.	2013-04-04 14:21:01 +02:00
antirez	05fa4f4034	Cluster: node timeout is now configurable.	2013-04-04 12:29:10 +02:00
antirez	00bab23c41	Cluster: turn hardcoded node timeout multiplicators into defines. Most Redis Cluster time limits are expressed in terms of the configured node timeout. Turn them into defines.	2013-04-04 12:04:11 +02:00
antirez	8419397665	Make rio.c comment 80-columns friendly.	2013-04-03 12:41:14 +02:00
antirez	b237de33d1	Throttle BGSAVE attempt on saving error. When a BGSAVE fails, Redis used to flood itself trying to BGSAVE at every next cron call, that is either 10 or 100 times per second depending on configuration and server version. This commit does not allow a new automatic BGSAVE attempt to be performed before a few seconds delay (currently 5). This avoids both the auto-flood problem and filling the disk with logs at a serious rate. The five seconds limit, considering a log entry of 200 bytes, will use less than 4 MB of disk space per day that is reasonable, the sysadmin should notice before of catastrofic events especially since by default Redis will stop serving write queries after the first failed BGSAVE. This fixes issue #849	2013-04-02 14:05:50 +02:00
antirez	b14fda7deb	Version bumped to 2.9.9.	2013-04-02 11:55:23 +02:00
Salvatore Sanfilippo	63457d1a5d	Merge pull request #1017 from jbergstroem/build-improvements Build improvements	2013-04-02 02:24:52 -07:00
charsyam	86d87e3554	Support for case unsensitive SET options.	2013-03-29 10:33:52 +01:00
antirez	30d5d416e6	Extended SET command implemented (issue #931 ).	2013-03-28 15:40:19 +01:00
antirez	140260409e	EXPIRE should not resurrect keys. Issue #1026 .	2013-03-28 12:45:07 +01:00
antirez	3816e7bda9	Better DEBUG error message when num of arguments is wrong.	2013-03-28 11:39:29 +01:00
antirez	32a83c8206	DEBUG set-active-expire added. We need the ability to disable the activeExpireCycle() (active expired key collection) call for testing purposes.	2013-03-27 17:55:02 +01:00
antirez	4fa30b781e	redis-trib: ClusterNode #info_string output modified. The hope is that the new one is more readable.	2013-03-26 18:16:03 +01:00
antirez	df69155e8a	Allow SELECT while loading the DB. Fixes issue #1024.	2013-03-26 13:51:17 +01:00
antirez	873f328fd8	TTL / PTTL commands: two bugs fixed. This commit fixes two corner cases for the TTL command. 1) When the key was already logically expired (expire time older than current time) the command returned -1 instead of -2. 2) When the key was existing and the expire was found to be exactly 0 (the key was just about to expire), the command reported -1 (that is, no expire) instead of a TTL of zero (that is, about to expire).	2013-03-26 11:45:22 +01:00
antirez	8bb5eb7357	Flag PUBLISH as read-only in the command table.	2013-03-26 11:09:22 +01:00
antirez	71f3e74317	Transactions: propagate MULTI/EXEC only when needed. MULTI/EXEC is now propagated to the AOF / Slaves only once we encounter the first command that is not a read-only one inside the transaction. The old behavior was to always propagate an empty MULTI/EXEC block when the transaction was composed just of read only commands, or even completely empty. This created two problems: 1) It's a bandwidth waste in the replication link and a space waste inside the AOF file. 2) We used to always increment server.dirty to force the propagation of the EXEC command, resulting into triggering RDB saves more often than needed. Note: even read-only commands may also trigger writes that will be propagated, when we access a key that is found expired and Redis will synthesize a DEL operation. However there is no need for this to stay inside the transaction itself, but only to be ordered. So for instance something like: MULTI GET foo SET key zap EXEC May be propagated into: DEL foo MULTI SET key zap EXEC While the DEL is outside the transaction, the commands are delivered in the right order and it is not possible for other commands to be inserted between DEL and MULTI.	2013-03-26 10:58:10 +01:00
antirez	02c269e27e	Transactions: use discardTransaction() in EXEC implementation.	2013-03-26 10:48:15 +01:00
antirez	2f49734029	Transactions: use the propagate() API to propagate MULTI. The behavior is the same, but the code is now cleaner and uses the proper interface instead of dealing directly with AOF/replication functions.	2013-03-26 10:27:45 +01:00
Salvatore Sanfilippo	afce010639	Merge pull request #1003 from NanXiao/patch-3 Update config.c	2013-03-25 11:30:59 -07:00
antirez	c39e34d007	Cluster: when slave changes master, remove it from the old master.	2013-03-25 15:01:25 +01:00
antirez	70091a2f63	redis-trib: wait cluster join after cluster creation.	2013-03-25 13:14:17 +01:00
antirez	34c1871e9f	Cluster: set node role on successful handshake.	2013-03-25 13:03:01 +01:00
antirez	cda0cdfb70	redis-trib: Don't use colorization if TERM != xterm.	2013-03-25 12:51:53 +01:00
antirez	4286d8b979	redis-trib: initial output colorization	2013-03-25 12:50:38 +01:00
antirez	09aa55a334	redis-cli --stat, stolen from redis-tools. Redis-tools is a connection of tools no longer mantained that was intented as a way to economically make sense of Redis in the pre-vmware sponsorship era. However there was a nice redis-stat utility, this commit imports one of the functionalities of this tool here in redis-cli as it seems to be pretty useful. Usage: redis-cli --stat The output is similar to vmstat in the format, but with Redis specific stuff of course. From the point of view of the monitored instance, only INFO is used in order to grab data.	2013-03-22 17:54:32 +01:00
antirez	c195289e5e	redis-trib: All output wrapped by a specific function. This is needed in order to colorize it as next step. We use conventions in output messages such as >>> This is an action *** This is a warning [ERR] This is an error [OK] That's fine And so forth, so that a color will be associated checking the first three chars.	2013-03-22 17:39:43 +01:00
antirez	be7bdd376e	redis-trib: fix open slot correction. Slot zero was hardcoded (!)	2013-03-22 13:03:33 +01:00
antirez	813d7cbdd1	redis-trib: added cluster_error method to add errors.	2013-03-22 12:59:18 +01:00
antirez	d7eae8d8d7	redis-trib: fixed ClusterNode migrating/importing slots detection.	2013-03-22 12:54:04 +01:00
antirez	1baa153028	redis-trib: added some more output for check.	2013-03-22 12:47:49 +01:00
antirez	77bba91b48	redis-trib: fixed type has_flags? -> has_flag.	2013-03-22 12:28:06 +01:00
antirez	4dded0c187	redis-trib: ignore slaves when resharding.	2013-03-21 18:17:06 +01:00
antirez	47da76576e	redis-trib: fix conditional otherwise always true.	2013-03-21 17:22:14 +01:00
antirez	a89d435d8e	Cluster: move slotToKeyFlush() to emptyDb(). This way we are sure to destroy the slot->key map every time we destroy the DB, for instance when reloading a DB due to replication.	2013-03-21 17:13:08 +01:00
antirez	ed47f77977	redis-trib: initial support to fix "open" slots. Open slots are slots found in importing or migrating slot when a cluster check is performed.	2013-03-21 17:11:54 +01:00
antirez	1a6df1049d	redis-trib: load info about importing/migrating slots from node.	2013-03-21 16:31:53 +01:00
antirez	a8b09faf3d	Cluster: comment no longer in sync with code removed.	2013-03-21 10:47:10 +01:00
Johan Bergström	8080e1cfbe	use `install` as default installer (except on SunOS)	2013-03-21 13:32:08 +11:00
antirez	8c1bc8e865	Cluster: clear the PROMOTED slave directly into clusterSetMaster(). This way we make sure every time a master is turned into a replica the flag will be cleared.	2013-03-20 11:51:44 +01:00
antirez	e006407fd0	Cluster: master node must clear its hash slots when turning into a slave. When a master turns into a slave after a failover event, make sure to clear the assigned slots before setting up the replication, as a slave should never claim slots in an explicit way, but just take over the master slots when replacing its master.	2013-03-20 11:32:35 +01:00
antirez	506f9a42b0	Cluster: new flag PROMOTED introduced. A slave node set this flag for itself when, after receiving authorization from the majority of nodes, it turns itself into a master. At the same time now this flag is tested by nodes receiving a PING message before reconfiguring after a failover event. This makes the system more robust: even if currently there is no way to manually turn a slave into a master it is possible that we'll have such a feature in the future, or that simply because of misconfiguration a node joins the cluster as master while others believe it's a slave. This alone is now no longer enough to trigger reconfiguration as other nodes will check for the PROMOTED flag. The PROMOTED flag is cleared every time the node is turned back into a replica of some other node.	2013-03-20 10:48:42 +01:00
antirez	026b9483db	Cluster: add sender flags in cluster bus messages header. Sender flags were not propagated for the sender, but only for nodes in the gossip section. This is odd and in the next commits we'll need to get updated flags for the sender node, so this commit adds a new field in the cluster messages header. The message header is the same size as we reused some free space that was marked as 'unused' because of alignment concerns.	2013-03-20 10:32:00 +01:00
antirez	d15b027d91	Cluster: turn old master into a replica of node that failed over. So when the failing master node is back in touch with the cluster, instead of remaining unused it is converted into a replica of the new master, ready to perform the fail over if the new master node will fail at some point. Note that as a side effect clients with stale configuration are now not an issue as well, as the node converted into a slave will not accept queries but will redirect clients accordingly.	2013-03-20 00:30:47 +01:00
antirez	4d62623015	Cluster: node replication role change handle improved. The code handling a master that turns into a slave or the contrary was improved in order to avoid repeating the same operations. Also the readability and conceptual simplicity was improved.	2013-03-19 16:01:30 +01:00
antirez	88221f88c0	Cluster: new command CLUSTER FLUSHSLOTS. It's just a simpler way to CLUSTER DELSLOTS with all the slots as arguments, in order to obtain a node without assigned slots for reconfiguration.	2013-03-19 09:58:05 +01:00
antirez	7d3e32d526	redis-trib: don't load cluster config from nodes in FAIL state.	2013-03-19 09:46:12 +01:00
Johan Bergström	dc4003be71	Silence mkdir output	2013-03-17 18:37:38 +11:00
Johan Bergström	348be19b5f	Only pass -rdynamic as linker option	2013-03-17 17:49:57 +11:00
Johan Bergström	33a4bc2c70	Remove extra spaces	2013-03-17 17:23:45 +11:00
Johan Bergström	978c895b69	make check is a common naming convention for tests	2013-03-16 18:40:22 +11:00
Johan Bergström	ada7aa7ac9	Spaces to tabs	2013-03-16 18:35:20 +11:00
Johan Bergström	bea60bec75	Slightly refactor CFLAGS/LDFLAGS/LIBS This way, we can avoid -rdynamic and -pthread warnings on darwin.	2013-03-16 18:33:42 +11:00
antirez	e28e61e839	Cluster: when failing over claim master slots.	2013-03-15 16:53:41 +01:00
antirez	b8127e337a	Version incremented to 2.9.8 after major cluster progresses.	2013-03-15 16:45:45 +01:00
antirez	dd091661d4	Cluster: log when a slave asks for failover authorization.	2013-03-15 16:44:08 +01:00
antirez	1375b0611b	Cluster: slaves start failover with a small delay. Redis Cluster can cope with a minority of nodes not informed about the failure of a master in time for some reason (netsplit or node not functioning properly, blocked, ...) however to wait a few seconds before to start the failover will make most "normal" failovers simpler as the FAIL message will propagate before the slave election happens.	2013-03-15 16:39:49 +01:00
antirez	d512a09c20	Cluster: a bit more serious node role change handling.	2013-03-15 16:35:16 +01:00
antirez	004fbef847	Cluster: remove node from master slaves when it turns into a master. Also, a few nearby comments improved.	2013-03-15 16:16:19 +01:00
antirez	44c92f5aeb	Cluster: slave failover implemented.	2013-03-15 16:11:34 +01:00
antirez	1d8f302e0d	Cluster: election -> promotion in two comments.	2013-03-15 15:44:49 +01:00
antirez	bf82195467	Cluster: added function to broadcast pings. See the function top-comment for info why this is useful sometimes.	2013-03-15 15:43:58 +01:00
antirez	892e98548a	Cluster: don't broadcast messages to HANDSHAKE nodes. Also don't check for NOADDR as we check that node->link is not NULL that's enough.	2013-03-15 15:36:36 +01:00
antirez	76a3954f4a	Cluster: fix clusterHandleSlaveFailover() conditional: quorum is enough.	2013-03-15 13:20:34 +01:00
antirez	90e99a2082	Cluster: two lame bugs fixed in FAILOVER AUTH messages generation.	2013-03-14 21:27:12 +01:00
antirez	aeacaa57e6	Cluster: code to process messages moved in the right if-else chain.	2013-03-14 21:21:58 +01:00
antirez	35f05c66b6	Cluster: handle FAILOVER_AUTH_ACK messages. That's trivial as we just need to increment the count of masters that received with an ACK.	2013-03-14 16:43:13 +01:00
antirez	c2595500ac	Cluster: request failover authorization, log if we have quorum. However the failover is yet not really performed.	2013-03-14 16:39:02 +01:00
antirez	7fa42b801d	Cluster: clusterSendFailoverAuth() implementation.	2013-03-14 16:31:57 +01:00
NanXiao	79a13b46fb	Update config.c Fix bug in configGetCommand function: get correct masterauth value.	2013-03-14 13:52:43 +08:00
antirez	f59ff6fe61	Cluster: clusterSendFailoverAuthIfNeeded() work in progress.	2013-03-13 19:08:03 +01:00
antirez	44f6fdab60	Cluster: handle FAILOVER_AUTH_REQUEST in clusterProcessPacket(). However currently the control is passed to a function doing nothing at all.	2013-03-13 18:38:08 +01:00
antirez	ece95b2dea	Cluster: sanity check FAILOVER_AUTH_REQUEST messages for proper length.	2013-03-13 17:31:26 +01:00
antirez	66144337bf	Cluster: use 'else if' for mutually exclusive conditionals.	2013-03-13 17:27:06 +01:00
antirez	db7c17e969	Cluster: FAILOVER_AUTH_REQUEST message type introduced. This message is sent by a slave that is ready to failover its master to other nodes to get the authorization from the majority of masters.	2013-03-13 17:21:20 +01:00
antirez	575cbc9990	Cluster: clusterHandleSlaveFailover() stub.	2013-03-13 13:10:49 +01:00
antirez	1902a9c532	Replication: master_link_down_since_seconds initial value should be huge. server.repl_down_since used to be initialized to the current time at startup. This is wrong since the replication never started. Clients testing this filed to check if data is uptodate should never believe data is recent if we never ever connected to our master.	2013-03-13 12:54:48 +01:00
antirez	3d448bda39	Cluster: call clusterHandleSlaveFailover() when our master is down.	2013-03-13 12:44:02 +01:00
antirez	79a6844e44	rdbLoad(): rework code to save vertical space.	2013-03-12 19:46:33 +01:00
Salvatore Sanfilippo	9925c7c670	Merge pull request #1001 from djanowski/fatal-errors-rdb-load Abort when opening the RDB file results in an error other than ENOENT.	2013-03-12 11:40:36 -07:00
Damian Janowski	4178a80282	Abort when opening the RDB file results in an error other than ENOENT. This fixes cases where the RDB file does exist but can't be accessed for any reason. For instance, when the Redis process doesn't have enough permissions on the file.	2013-03-12 14:37:50 -03:00
antirez	215bfaea16	Set default for stop_writes_on_bgsave_err in initServerConfig(). It was placed for error in initServer() that's called after the configuation is already loaded, causing issue #1000.	2013-03-12 18:34:08 +01:00
antirez	91d3b487e7	redis-cli --bigkeys: don't crash with empty DBs.	2013-03-12 09:58:00 +01:00
antirez	2d851333a6	activeExpireCycle() smarter with many DBs and under expire pressure. activeExpireCycle() tries to test just a few DBs per iteration so that it scales if there are many configured DBs in the Redis instance. However this commit makes it a bit smarter when one a few of those DBs are under expiration pressure and there are many many keys to expire. What we do is to remember if in the last iteration had to return because we ran out of time. In that case the next iteration we'll test all the configured DBs so that we are sure we'll test again the DB under pressure. Before of this commit after some mass-expire in a given DB the function tested just a few of the next DBs, possibly empty, a few per iteration, so it took a long time for the function to reach again the DB under pressure. This resulted in a lot of memory being used by already expired keys and never accessed by clients.	2013-03-11 11:10:33 +01:00
antirez	08b107e405	In databasesCron() never test more DBs than we have.	2013-03-11 10:51:03 +01:00
antirez	4b1ccdfd49	Make comment name match var name in activeExpireCycle().	2013-03-11 10:42:14 +01:00
antirez	1f7d2c1e27	Optimize inner loop of activeExpireCycle() for no-expires case.	2013-03-09 11:48:54 +01:00
antirez	5f5aa487f9	REDIS_DBCRON_DBS_PER_SEC -> REDIS_DBCRON_DBS_PER_CALL	2013-03-09 11:44:20 +01:00
antirez	db29d71a30	activeExpireCycle(): process only a small number of DBs per iteration. This small number of DBs is set to 16 so actually in the default configuraiton Redis should behave exactly like in the past. However the difference is that when the user configures a very large number of DBs we don't do an O(N) operation, consuming a non trivial amount of CPU per serverCron() iteration.	2013-03-08 17:48:58 +01:00
antirez	40a2da159c	Use unsigned integers for DB ids, for defined wrap-to-zero.	2013-03-08 17:41:20 +01:00
antirez	7ac3b3a486	Only resize/rehash a few databases per cron iteration. This is the first step to lower the CPU usage when many databases are configured. The other is to also process a limited number of DBs per call in the active expire cycle.	2013-03-08 14:01:12 +01:00
antirez	dfd732dff3	Actually call databasesCron() inside serverCron().	2013-03-08 13:59:50 +01:00
antirez	cd9dcd1835	Move Redis databases background processing to databasesCron().	2013-03-08 12:34:05 +01:00
antirez	f0b807cd47	Cluster: update cluster state on PFAIL flag set/cleared on nodes.	2013-03-07 15:40:53 +01:00
antirez	299b8f76c2	Cluster: mark cluster state as fail of majority of masters is unreachable.	2013-03-07 15:36:59 +01:00
antirez	abf06fd5ff	Cluster: log global cluster state change.	2013-03-07 15:22:32 +01:00
antirez	3dad8196b7	Cluster: clusterUpdateState() function simplified. Also the NEEDHELP Cluster state was removed as it will no longer be used by Redis Cluster.	2013-03-06 18:25:40 +01:00
Gengliang Wang	042ed270c8	Removed useless "return" statements in pubsub.c (original commit message edited)	2013-03-06 16:49:20 +01:00
antirez	7b190a08cf	API to lookup commands with their original name. A new server.orig_commands table was added to the server structure, this contains a copy of the commant table unaffected by rename-command statements in redis.conf. A new API lookupCommandOrOriginal() was added that checks both tables, new first, old later, so that rewriteClientCommandVector() and friends can lookup commands with their new or original name in order to fix the client->cmd pointer when the argument vector is renamed. This fixes the segfault of issue #986, but does not fix a wider range of problems resulting from renaming commands that actually operate on data and are registered into the AOF file or propagated to slaves... That is command renaming should be handled with care.	2013-03-06 16:28:26 +01:00
antirez	bfa25441e7	Handle a non-impossible empty argv in loadServerConfigFromString(). Usually this does not happens since we trim for " \t\r\n", but if there are other chars that return true with isspace(), we may end with an empty argv. Better to handle the condition in an explicit way.	2013-03-06 12:40:48 +01:00
antirez	8c193af696	redis-cli: use sdsfreesplitres() instead of hand-coding it.	2013-03-06 12:38:32 +01:00
antirez	011fa89ac9	Cluster: sdssplitargs_free() -> sdsfreesplitres().	2013-03-06 12:38:06 +01:00
antirez	729a3432ba	sds.c: sdssplitargs_free() removed as it was a duplicate.	2013-03-06 12:38:06 +01:00
antirez	cf4d7737bb	More specific error message in loadServerConfigFromString().	2013-03-06 12:24:12 +01:00
antirez	4ea89e64c0	sdssplitargs(): on error set *argc to 0. This makes programs not checking the return value for NULL much safer since with this change: 1) It is still possible to iterate the zero-length result without crashes. 2) sdssplitargs_free will work against NULL and 0 count.	2013-03-06 12:21:31 +01:00
antirez	5cabae84e6	sdssplitargs(): now returns NULL only on error. An empty input string also resulted into the function returning NULL making it harder for the caller to distinguish between error and empty string without checking the original input string length.	2013-03-06 12:21:21 +01:00
charsyam	1303f02be6	Don't segfault on unbalanced quotes.	2013-03-06 11:54:02 +01:00
antirez	304ef5e283	Allow AUTH while loading the DB in memory. While Redis is loading the AOF or RDB file in memory only a subset of commands are allowed. This commit adds AUTH to this subset.	2013-03-06 11:50:38 +01:00
antirez	1025dd7786	Cluster: connect to our master ASAP after startup if we are a slave node.	2013-03-05 16:12:08 +01:00
antirez	bac57ad14b	Cluster: more robust FAIL flag cleaup. If we have a master in FAIL state that's reachable again, and apparently no one is going to serve its slots, clear the FAIL flag and let the cluster continue with its operations again.	2013-03-05 15:05:32 +01:00
antirez	1a02b7440a	Cluster: new node field fail_time. This is the unix time at which we set the FAIL flag for the node. It is only valid if FAIL is set. The idea is to use it in order to make the cluster more robust, for instance in order to revert a FAIL state if it is long-standing but still slots are assigned to this node, that is, no one is going to fix these slots apparently.	2013-03-05 13:15:05 +01:00
antirez	d3b4662347	Cluster: don't check keys hash slots when the source is our master. Usually we redirect clients to the right hash slot, however we don't want to do that with our master, we want just to mirror it.	2013-03-05 13:02:44 +01:00
antirez	31ac376051	Cluster: slaveof not allowed in redis.conf as well.	2013-03-05 12:58:22 +01:00
antirez	b7d085fc0d	Cluster: SLAVEOF command not allowed in cluster mode.	2013-03-05 12:39:41 +01:00
antirez	e4b481a5f6	Cluster: A comment updated in clusterCron().	2013-03-05 12:17:30 +01:00
antirez	d728ec6dee	Cluster: send a ping to every node we never contacted in timeout/2 seconds. Usually we try to send just 1 ping every second, however when we detect we are going to have unreliable failure detection because we can't ping some node in time, send an additional ping. This should only happen with very large clusters or when the the node timeout is set to a very low value.	2013-03-05 12:16:02 +01:00
antirez	e7628be2a7	Cluster: set node->slaveof correctly when a node state is updated.	2013-03-05 11:50:11 +01:00
antirez	d6457577d4	Cluster: don't perform startup slots sanity check for slaves. If we are a cluster node the DB content will not match our configured slots. Don't do the check at all.	2013-03-04 19:47:00 +01:00
antirez	d334897e80	Cluster: fix maximum line length when loading config. There are pathological cases where the line can be even longer a single node may contain all the slots in importing/migrating state.	2013-03-04 19:45:36 +01:00
antirez	3be893123f	Make sure replicationSetMaster() works when ip argument is not an sds.	2013-03-04 15:39:55 +01:00
antirez	b8a28bf442	Cluster: actually setup replication in CLUSTER REPLICATE.	2013-03-04 15:27:58 +01:00
antirez	7bead003e2	SLAVEOF command refactored into a proper API. We now have replicationSetMaster() and replicationUnsetMaster() that can be called in other contexts (for instance Redis Cluster).	2013-03-04 13:22:21 +01:00
antirez	0c01088b51	Cluster: REPLICATE subcommand and stub for clusterSetMaster().	2013-03-04 13:15:09 +01:00
charsyam	bc84c399f8	adding check error code adding check error code	2013-03-04 11:20:11 +01:00
antirez	3b3974410e	redis-cli: use keepalive socket option. This should improve things in two ways: 1) Prevent timeouts caused by the execution of long commands. 2) Improve detection of real connection errors. This is mostly effective only on Linux because of the bogus default keepalive settings. In Linux we have OS-specific calls to set the keepalive interval to reasonable values.	2013-03-04 11:14:32 +01:00
Salvatore Sanfilippo	174e51cb75	Merge pull request #967 from 0x20h/fix-git-diff suppress external diff program when using git diff.	2013-03-04 01:57:01 -08:00
antirez	caf9b24a7d	Cluster: don't set the slot as unassigned because of PONG info. As stated in the comment this is usually due to a resharding in progress so the client should be still redirected to the old node that will handle the redirection elsewhere.	2013-02-28 15:54:29 +01:00
antirez	0d77440b26	Cluster: better handling of slots changes in PONG packets. The new code makes sure that the node slots bitmap is always consistent with the cluster->slots array.	2013-02-28 15:41:54 +01:00
antirez	5f8fd27ace	Cluster: refactoring of clusterNode*Bit to use helper bitmap functions.	2013-02-28 15:23:09 +01:00
antirez	d21d6b666f	Cluster: use node->numslots instead of popcount() where possible.	2013-02-28 15:13:32 +01:00
antirez	4521115b17	Cluster: new field in cluster node structure, "numslots". Before a relatively slow popcount() operation was needed every time we needed to get the number of slots served by a given cluster node. Now we just need to check an integer that is taken in sync with the bitmap.	2013-02-28 15:11:05 +01:00
antirez	a2566d6618	Cluster: don't gossip about nodes that are not useful to the cluster.	2013-02-28 15:00:09 +01:00
antirez	bc922dc688	redis-trib: skip nodes without slots when creating the config signature.	2013-02-28 13:12:56 +01:00
antirez	64942fca01	redis-trib help.	2013-02-27 18:02:22 +01:00
antirez	d45d184118	Cluster: CLUSTER FORGET implemented.	2013-02-27 17:55:59 +01:00
antirez	d2b8281b3f	Cluster: added a missing return on CLUSTER SETSLOT.	2013-02-27 17:53:48 +01:00
antirez	7ddc0fe652	redis-trib: skip noaddr and disconnected nodes while loading cluster info.	2013-02-27 17:23:11 +01:00
antirez	d20dea3eb7	Cluster: blank node address when flagging it as NOADDR.	2013-02-27 17:09:33 +01:00
antirez	2dcb5ab72b	Cluster: add comments in sub-sections of CLUSTER command.	2013-02-27 16:12:59 +01:00
antirez	96dd210970	redis-trib: initial implementation of addnode command.	2013-02-27 15:58:41 +01:00
antirez	f7dac639a9	Remove warning when printing redisBuildId().	2013-02-27 12:33:27 +01:00
antirez	e4682c82e3	Remove too agressive/spamming log in rdb.c.	2013-02-27 12:29:20 +01:00
antirez	f9b5ca29fd	Use GCC printf format attribute for redisLog(). This commit also fixes redisLog() statements producing warnings.	2013-02-27 12:27:15 +01:00
antirez	c35b065a64	Better panic message for failed time event creation.	2013-02-27 12:00:11 +01:00
Stam He	e431a97660	add a check for aeCreateTimeEvent 1) Add a check for aeCreateTimeEvent in function initServer.	2013-02-27 11:57:35 +01:00
Stam He	9c8be6cab9	Set proctitle: avoid the use of __attribute__((constructor)). This cased a segfault in some Linux system and was GCC-specific. Commit modified by @antirez: 1) Stripped away the part to set the proc title via config for now. 2) Handle initialization of setproctitle only when the replacement is used. 3) Don't require GCC now that the attribute constructor is no longer used.	2013-02-27 11:50:35 +01:00
antirez	deb1f4d841	setproctitle.c: declar tmp as static so valgrind will not detect a leak.	2013-02-26 15:51:33 +01:00
antirez	d0992d6e8b	Cluster: a few random fixes to the new failure detection.	2013-02-26 15:15:44 +01:00
antirez	f288b07563	Cluster: log the event when we clear the FAIL flag.	2013-02-26 15:03:38 +01:00
antirez	97ffcd351b	Cluster: use the failure report API to reimplement failure detection. The new system detects a failure only when there is quorum from masters.	2013-02-26 14:58:39 +01:00
antirez	6356cf6808	Set process name in ps output to make operations safer. This commit allows Redis to set a process name that includes the binding address and the port number in order to make operations simpler. Redis children processes doing AOF rewrites or RDB saving change the name into redis-aof-rewrite and redis-rdb-bgsave respectively. This in general makes harder to kill the wrong process because of an error and makes simpler to identify saving children. This feature was suggested by Arnaud GRANAL in the Redis Google Group, Arnaud also pointed me to the setproctitle.c implementation includeed in this commit. This feature should work on all the Linux, OSX, and all the three major BSD systems.	2013-02-26 11:52:12 +01:00
antirez	1b1b3f6c06	Cluster: invert two functions declarations in more natural order.	2013-02-26 11:19:48 +01:00
antirez	d5e8b0a47f	Cluster: cleanup idle failure reports every time we remove one. This is not very important as anyway when the function counting the number of reports is called the cleanup is performed. However with this change if only part of the nodes that reported the failure will report the node is back ok, we'll cleanup the older entries ASAP. In complex split net split scenarios, and when we are dealing with clusters having nodes in the order of ~ 1000, this can save some CPU.	2013-02-26 11:15:18 +01:00
antirez	9cb578ced0	Cluster: new function clusterNodeDelFailureReport() for failure reports. This is the missing part of the API that will be used to reimplement failure detection of Cluster nodes.	2013-02-25 19:13:22 +01:00
Arnaud Granal	3d588b37ce	Fix error "repl-backlog-size must be 1 or greater" The parameter repl-backlog-size is not parsed correctly in the configuration file. argv[0] is parsed instead of argv[1].	2013-02-25 19:25:48 +02:00
antirez	18f537083a	Cluster: no limits for the count parameter of CLUSTER GETKEYSINSLOT. Not sure why I set a limit to 1 million keys, there is no reason for this artificial limit, and anyway this is s a stupid limit because it is already high enough to create latency issues. So let's the users shoot on their feet because maybe they just actually know what they are doing.	2013-02-25 12:41:13 +01:00
antirez	544bbe5387	Cluster: validate slot number in CLUSTER COUNTKEYSINSLOT.	2013-02-25 12:40:32 +01:00
antirez	996a643752	Cluster: use O(log(N)) algo for countKeysInSlot().	2013-02-25 12:37:50 +01:00
antirez	d4fa40655d	Cluster: new sub-command CLUSTER COUNTKEYSINSLOT. The new sub-command uses the new countKeysInSlot() API and allows a cluster client to get the number of keys for a given hashslot.	2013-02-25 12:04:31 +01:00
antirez	a517c89321	Cluster: verifyClusterConfigWithData() implemented.	2013-02-25 11:43:49 +01:00
antirez	9947b0d96d	A comment in main() clarified.	2013-02-25 11:40:21 +01:00
antirez	d2154254be	Cluster: fix case for getKeysInSlot() and countKeysInSlot(). Redis functions start in low case. A few functions about cluster were capitalized the wrong way.	2013-02-25 11:25:40 +01:00
antirez	c2eb4a606f	Cluster: use CountKeysInSlot() when we just need the count.	2013-02-25 11:23:04 +01:00
antirez	ad3bca1fdf	Cluster: added stub for verifyClusterConfigWithData(). See the top-comment for the function in this commit for details about what the function is supposed to do.	2013-02-25 11:20:17 +01:00
antirez	1abce14611	Cluster: added new API countKeysInSlot(). This is similar to getKeysInSlot() but just returns the number of keys found in a given hash slot, without returning the keys.	2013-02-25 11:15:03 +01:00
0x20h	977035b7b5	suppress external diff program when using git diff.	2013-02-24 18:17:46 +01:00
antirez	825e07f2fd	Cluster: if no previous config exists, create the myself node as master.	2013-02-22 19:24:01 +01:00
antirez	f4093753e4	Cluster: add cluster_size field in CLUSTER INFO output.	2013-02-22 19:20:38 +01:00
antirez	d218a4e244	Cluster: new state information, cluster size. The definition of cluster size is: the number of known nodes in the cluster that are masters and serving at least an hash slot.	2013-02-22 19:18:30 +01:00
antirez	5c55ed9388	Cluster: remove warning adding clusterNodeSetSlotBit() prototype.	2013-02-22 17:45:49 +01:00
antirez	974929770b	Cluster: introduced a failure reports system. A §Redis Cluster node used to mark a node as failing when itself detected a failure for that node, and a single acknowledge was received about the possible failure state. The new API will be used in order to possible to require that N other nodes have a PFAIL or FAIL state for a given node for a node to set it as failing.	2013-02-22 17:43:35 +01:00
antirez	36af851550	redis-trib: check that all the nodes agree about the slots configuration.	2013-02-22 12:25:16 +01:00
antirez	51b5058d04	redis-trib: skeleton of coverage fix for "keys in multiple nodes" case.	2013-02-22 11:33:10 +01:00
antirez	a81c598f95	redis-trib: handle slot coverage fix in the "no nodes with keys" case.	2013-02-22 10:23:53 +01:00
antirez	392e0fa7eb	Cluster: fix case of slotToKey...() functions.	2013-02-22 10:16:21 +01:00
antirez	d04770988d	Cluster: empty the internal sorted set mapping keys to slots on FLUSHDB/ALL.	2013-02-22 10:15:32 +01:00
antirez	efe51dfff5	redis-trib: specify single node address when fixing coverage.	2013-02-22 10:05:07 +01:00
antirez	915e81335e	redis-trib: ability to fix uncovered slots for the trivial case.	2013-02-21 18:10:06 +01:00
antirez	619d3945f8	redis-trib: fixed typo in method name.	2013-02-21 16:58:27 +01:00
antirez	07b6322735	Cluster: more correct update of slots state when PONG is received.	2013-02-21 16:52:06 +01:00
antirez	c6da9d9fac	Call clusterUpdateState() after CLUSTER SETSLOT too.	2013-02-21 16:31:22 +01:00
antirez	3a99d1228a	Aesthetic change to make a line more 80-cols friendly.	2013-02-21 16:24:48 +01:00
antirez	5fd9f701da	redis-trib: move instance vars in the right class.	2013-02-21 13:06:59 +01:00
antirez	7898bf4b7e	redis-trib: some refactoring and skeleton of the "fix" command.	2013-02-21 13:00:41 +01:00
antirez	dc4af60628	Cluster: clusterAddSlot() was not doing what stated in the comment.	2013-02-21 11:51:17 +01:00
antirez	fdb57233e2	Cluster: always use cluster(Add\|Del)Slot to modify the cluster slots table.	2013-02-21 11:44:58 +01:00
antirez	786b8d6c87	Use RESTORE-ASKING for MIGRATE when in cluster mode.	2013-02-20 17:36:54 +01:00
antirez	ea7fc82a4a	Cluster: new command flag forcing implicit ASKING. Also using this new flag the RESTORE-ASKING command was implemented that will be used by MIGRATE.	2013-02-20 17:28:35 +01:00
antirez	9ec1b709f5	Cluster: ASKING command fixed, state was not retained.	2013-02-20 17:07:52 +01:00
antirez	b8d8b9ec41	redis-trib: set the migrating slot in the correct way when resharding.	2013-02-20 15:29:53 +01:00
antirez	9a04e12cc0	Cluster: I/O errors are now logged at DEBUG level.	2013-02-20 13:18:51 +01:00
antirez	917dd53216	redis-trib: make a few comments 80-cols friendly.	2013-02-15 17:11:55 +01:00
antirez	455da35c7f	Cluster: specific error code for cluster down condition.	2013-02-15 16:53:24 +01:00
antirez	522c3255db	Merge branch 'unstable' of github.com:antirez/redis into unstable	2013-02-15 16:45:04 +01:00
antirez	02796ba7a7	Cluster: sanity checks on the cluster bus message length.	2013-02-15 16:44:39 +01:00
antirez	8853698a6f	Removed useless newlines from hashTypeCurrentObject().	2013-02-15 13:12:55 +01:00
antirez	6b9c661838	Cluster: make valgrind happy initializing all the bytes of the node IP.	2013-02-15 12:58:35 +01:00
antirez	7371d5e248	Remove wrong decrRefCount() from getNodeByQuery(). This fixes issue #607.	2013-02-15 11:57:53 +01:00
antirez	20f52b5b78	Top comment for getNodeByQuery() improved.	2013-02-15 11:50:54 +01:00
antirez	6b641f3aeb	redis-cli: update prompt on cluster redirection.	2013-02-14 18:49:08 +01:00
antirez	e0e15bd06d	Cluster: with 16384 slots we need bigger buffers.	2013-02-14 15:36:33 +01:00
antirez	1a32d99b28	Cluster: move cluster config file out of config state. This makes us able to avoid allocating the cluster state structure if cluster is not enabled, but still we can handle the configuration directive that sets the cluster config filename.	2013-02-14 15:20:02 +01:00
antirez	1649e509c3	Cluster: the cluster state structure is now heap allocated.	2013-02-14 13:20:56 +01:00
antirez	9dfd11c3da	Cluster: Initialize ip and port in createClusterNode().	2013-02-14 13:01:28 +01:00
antirez	a26690e8b5	Cluster: redis-trib updated to use 16384 hash slots.	2013-02-14 12:55:34 +01:00
antirez	ebd666db47	Cluster: from 4096 to 16384 hash slots.	2013-02-14 12:49:16 +01:00
antirez	072c91fe13	PSYNC: another change to unexpected reply from PSYNC.	2013-02-13 18:43:40 +01:00
antirez	0e1be5347b	PSYNC: More robust handling of unexpected reply to PSYNC.	2013-02-13 18:33:33 +01:00
antirez	7404b95833	Avoid compiler warning by casting to match printf() specifier.	2013-02-13 13:38:20 +01:00
antirez	3419c8ce70	Replication: more strict error checking for master PING reply.	2013-02-12 16:53:27 +01:00
antirez	dc24a6b132	Return a specific NOAUTH error if authentication is required.	2013-02-12 16:25:41 +01:00
antirez	24f258360b	Replication: added new stats counting full and partial resynchronizations.	2013-02-12 15:33:54 +01:00
antirez	04bdb3a2a4	Add missing bracket removed for error after rebase of PSYNC.	2013-02-12 12:56:32 +01:00
antirez	3af478e9ef	PSYNC: debugging printf() calls are now logs at DEBUG level.	2013-02-12 12:52:22 +01:00
antirez	89b48f0825	Remove harmless warning in slaveTryPartialResynchronization().	2013-02-12 12:52:21 +01:00
antirez	0ed6daa48b	PSYNC: don't use the client buffer to send +CONTINUE and +FULLRESYNC. When we are preparing an handshake with the slave we can't touch the connection buffer as it'll be used to accumulate differences between the sent RDB file and what arrives next from clients. So in short we can't use addReply() family functions. However we just use write(2) because we know that the socket buffer is empty, since a prerequisite for SYNC to work is that the static buffer and the output list are empty, and in general it is not expected that a client SYNCs after doing some heavy I/O with the master. However a short write connection is explicitly handled to avoid fragility (we simply close the connection and the slave will retry).	2013-02-12 12:52:21 +01:00
antirez	d2a0348a49	SYNC not allowed with pending data on the static output buffer.	2013-02-12 12:52:21 +01:00
antirez	da315d3325	Log the unexpected string received in place of the SYNC payload length.	2013-02-12 12:52:21 +01:00
antirez	41d64a7516	After SLAVEOF <newslave> don't allow chained slaves to PSYNC.	2013-02-12 12:52:21 +01:00
antirez	078882025e	PSYNC: work in progress, preview #2 , rebased to unstable.	2013-02-12 12:52:21 +01:00
antirez	e34a35a511	Use the new unified protocol to send SELECT to slaves. SELECT was still transmitted to slaves using the inline protocol, that is conceived mostly for humans to type into telnet sessions, and is notably not understood by redis-cli --slave. Now the new protocol is used instead.	2013-02-12 12:50:28 +01:00
antirez	4b83ad4e1f	Use replicationFeedSlaves() to send PING to slaves. A Redis master sends PING commands to slaves from time to time: doing this ensures that even if absence of writes, the master->slave channel remains active and the slave can feel the master presence, instead of closing the connection for timeout. This commit changes the way PINGs are sent to slaves in order to use the standard interface used to replicate all the other commands, that is, the function replicationFeedSlaves(). With this change the stream of commands sent to every slave is exactly the same regardless of their exact state (Transferring RDB for first synchronization or slave already online). With the previous implementation the PING was only sent to online slaves, with the result that the output stream from master to slaves was not identical for all the slaves: this is a problem if we want to implement partial resyncs in the future using a global replication stream offset. TL;DR: this commit should not change the behaviour in practical terms, but is just something in preparation for partial resynchronization support.	2013-02-12 12:50:28 +01:00
antirez	7465ac7ab1	Emit SELECT to slaves in a centralized way. Before this commit every Redis slave had its own selected database ID state. This was not actually useful as the emitted stream of commands is identical for all the slaves. Now the the currently selected database is a global state that is set to -1 when a new slave is attached, in order to force the SELECT command to be re-emitted for all the slaves. This change is useful in order to implement replication partial resynchronization in the future, as makes sure that the stream of commands received by slaves, including SELECT commands, are exactly the same for every slave connected, at any time. In this way we could have a global offset that can identify a specific piece of the master -> slaves stream of commands.	2013-02-12 12:50:28 +01:00
antirez	1a27d41156	Makefile: valgrind target added (forces -O0 and libc malloc).	2013-02-11 12:11:28 +01:00
antirez	76a5b21c3a	Tcp keep-alive: send three probes before detectin an error. Otherwise we end with less reliable connections because it's too easy that a single packet gets lost.	2013-02-08 17:06:01 +01:00
antirez	124a635bc5	Set SO_KEEPALIVE on client sockets if configured to do so.	2013-02-08 16:40:59 +01:00
antirez	ee21c18e5d	Add SO_KEEPALIVE support to anet.c.	2013-02-08 16:30:26 +01:00
antirez	a6c2f9012f	Make all WATCHers dirty when the slave reloads the DB.	2013-02-08 10:26:19 +01:00
antirez	46dd4c62b3	LASTSAVE is a "random" command.	2013-02-07 19:13:00 +01:00
antirez	b70b459b0e	TCP_NODELAY after SYNC: changes to the implementation.	2013-02-05 12:04:30 +01:00
charsyam	c85647f354	Turn off TCP_NODELAY on the slave socket after SYNC. Further details from @antirez: It was reported by @StopForumSpam on Twitter that the Redis replication link was strangely using multiple TCP packets for multiple commands. This wastes a lot of bandwidth and is due to the TCP_NODELAY option we enable on the socket after accepting a new connection. However the master -> slave channel is a one-way channel since Redis replication is asynchronous, so there is no point in trying to reduce the latency, we should aim to reduce the bandwidth. For this reason this commit introduces the ability to disable the nagle algorithm on the socket after a successful SYNC. This feature is off by default because the delay can be up to 40 milliseconds with normally configured Linux kernels.	2013-02-05 12:04:25 +01:00
Rock Li	8063155cd0	retval doesn't initalized If each if conditions are all fail, variable retval will under uninitlized	2013-02-05 15:56:04 +08:00
Gengliang Wang	002747336a	Fix a bug in srandmemberWithCountCommand() In CASE 2, the call sunionDiffGenericCommand will involve the string "srandmember" > sadd foo one (integer 1) > sadd srandmember two (integer 2) > srandmember foo 3 1)"one" 2)"two"	2013-02-04 14:01:08 +08:00
antirez	089cbe643f	Sentinel: advertise the promoted slave address only after successful setup.	2013-01-31 17:19:21 +01:00
Salvatore Sanfilippo	aca005c246	Merge pull request #914 from catwell/unstable fix comments forgotten in #285 (zipmap -> ziplist)	2013-01-31 03:37:48 -08:00
antirez	ad297a1a67	Z*STORE event fixed: generate del only if resulting sorted set is empty.	2013-01-29 13:50:01 +01:00
antirez	e41d1d77e3	Generate del events when S*STORE commands delete the destination key.	2013-01-29 13:43:13 +01:00
antirez	4dfb5752e0	Send 'expired' events when a key expires by lookup.	2013-01-28 13:15:19 +01:00
antirez	562b2bd6a7	Keyspace notifications: fixed a leak and a bug introduced in the latest commit.	2013-01-28 13:15:16 +01:00
antirez	fce016d31b	Keyspace events: it is now possible to select subclasses of events. When keyspace events are enabled, the overhead is not sever but noticeable, so this commit introduces the ability to select subclasses of events in order to avoid to generate events the user is not interested in. The events can be selected using redis.conf or CONFIG SET / GET.	2013-01-28 13:15:12 +01:00
antirez	1c0c551776	decrRefCount -> decrRefCountVoid in list constructor.	2013-01-28 13:15:08 +01:00
antirez	da04e6ed44	Keyspace events added for more commands.	2013-01-28 13:14:56 +01:00
antirez	8766e81079	Fix decrRefCount() prototype from void to robj pointer. decrRefCount used to get its argument as a void* pointer in order to be used as destructor where a 'void free_object(void)' prototype is expected. However this made simpler to introduce bugs by freeing the wrong pointer. This commit fixes the argument type and introduces a new wrapper called decrRefCountVoid() that can be used when the void argument is needed.	2013-01-28 13:14:53 +01:00
antirez	2d20e68fe4	notifyKeyspaceEvent(): release channel names using the right pointers.	2013-01-28 13:14:49 +01:00
antirez	5b9357a6b3	Initial test events for the new keyspace notification API.	2013-01-28 13:14:46 +01:00
antirez	2ea9518a53	Fixed over-80-cols comment in db.c	2013-01-28 13:14:42 +01:00
antirez	08d8bb57df	Two fixes to initial keyspace notifications API.	2013-01-28 13:14:39 +01:00
antirez	4cdbce341e	Keyspace events notification API.	2013-01-28 13:14:36 +01:00
Pierre Chapuis	50d43a9823	fix comments forgotten in #285 (zipmap -> ziplist)	2013-01-28 11:07:17 +01:00
antirez	88015b89a0	redis-cli --bigkeys output is now simpler to understand.	2013-01-21 19:15:58 +01:00
antirez	2039f1a38a	UNSUBSCRIBE and PUNSUBSCRIBE: always provide a reply. UNSUBSCRIBE and PUNSUBSCRIBE commands are designed to mass-unsubscribe the client respectively all the channels and patters if called without arguments. However when these functions are called without arguments, but there are no channels or patters we are subscribed to, the old behavior was to don't reply at all. This behavior is broken, as every command should always reply. Also it is possible that we are no longer subscribed to a channels but we are subscribed to patters or the other way around, and the client should be notified with the correct number of subscriptions. Also it is not pretty that sometimes we did not receive a reply at all in a redis-cli session from these commands, blocking redis-cli trying to read the reply. This fixes issue #714.	2013-01-21 19:02:26 +01:00
antirez	93f61bb2a4	Fixed a bug in memtest progress bar, that had no actual effects. This closes issue #859, thanks to @erbenmo.	2013-01-21 12:34:22 +01:00
Salvatore Sanfilippo	442235fe40	Merge pull request #869 from bilalhusain/patch-2 s/adiacent/adjacent/	2013-01-21 03:19:02 -08:00
antirez	cd892d015d	Not every __sun has backtrace(). I don't know how to test for Open Solaris that has support for backtrace() so for now removing the #ifdef that breaks compilation under other Solaris flavors.	2013-01-21 12:06:54 +01:00
antirez	e50cdbe461	Additionally two typos fixed thanks to @jodal	2013-01-19 13:46:14 +01:00
antirez	79a0ef62db	Whitelist SIGUSR1 to avoid auto-triggering errors. This commit fixes issue #875 that was caused by the following events: 1) There is an active child doing BGSAVE. 2) flushall is called (or any other condition that makes Redis killing the saving child process). 3) An error is sensed by Redis as the child exited with an error (killed by a singal), that stops accepting write commands until a BGSAVE happens to be executed with success. Whitelisting SIGUSR1 and making sure Redis always uses this signal in order to kill its own children fixes the issue.	2013-01-19 13:30:38 +01:00
antirez	ab247fc176	Clear server.shutdown_asap on failed shutdown. When a SIGTERM is received Redis schedules a shutdown. However if it fails to perform the shutdown it must be clear the shutdown_asap flag otehrwise it will try again and again possibly making the server unusable.	2013-01-19 13:19:41 +01:00
antirez	08d200baeb	Slowlog: don't log EXEC but just the executed commands. The Redis Slow Log always used to log the slow commands executed inside a MULTI/EXEC block. However also EXEC was logged at the end, which is perfectly useless. Now EXEC is no longer logged and a test was added to test this behavior. This fixes issue #759.	2013-01-19 12:53:21 +01:00
guiquanz	9d09ce3981	Fixed many typos.	2013-01-19 10:59:44 +01:00
Salvatore Sanfilippo	61dfc2e521	Merge pull request #887 from charsyam/redis-cli-prompt redis-cli prompt bug fix	2013-01-19 01:32:28 -08:00
Salvatore Sanfilippo	74f137308e	Merge pull request #895 from badboy/catch_con_error redis-cli: always exit if connection fails.	2013-01-19 01:27:56 -08:00
bitterb	c2dc172a9d	Fix an error reply for CLIENT command	2013-01-19 14:11:33 +09:00
Jan-Erik Rediger	59046a7373	Always exit if connection fails. This avoids unnecessary core dumps. Fixes antirez/redis#894	2013-01-18 10:13:10 +01:00
Nathan Parry	1920cab3bc	redis-cli --rdb fails if server sends a ping Redis pings slaves in "pre-synchronization stage" with newlines. (See https://github.com/antirez/redis/blob/2.6.9/src/replication.c#L814) However, redis-cli does not expect this - it sees the newline as the end of the bulk length line, and ends up returning 0 as bulk the length. This manifests as the following when running redis-cli: $ ./src/redis-cli --rdb some_file SYNC sent to master, writing 0 bytes to 'some_file' Transfer finished with success. With this commit, we just ignore leading newlines while reading the bulk length line. To reproduce the problem, load enough data into Redis so that the preparation of the RDB snapshot takes long enough for a ping to occur while redis-cli is waiting for the data.	2013-01-18 00:10:58 -05:00
charsyam	e396236651	redis-cli prompt bug fix	2013-01-16 17:20:54 -08:00
antirez	a0c24821e2	redis-cli: save an RDB dump from remote server to local file.	2013-01-16 19:44:37 +01:00
antirez	9b89ab06c4	Typo fixed, ASCI -> ASCII.	2013-01-15 13:34:13 +01:00
antirez	1971740f0c	CLIENT GETNAME and CLIENT SETNAME introduced. Sometimes it is much simpler to debug complex Redis installations if it is possible to assign clients a name that is displayed in the CLIENT LIST output. This is the case, for example, for "leaked" connections. The ability to provide a name to the client makes it quite trivial to understand what is the part of the code implementing the client not releasing the resources appropriately. Behavior: CLIENT SETNAME: set a name for the client, or remove the current name if an empty name is set. CLIENT GETNAME: get the current name, or a nil. CLIENT LIST: now displays the client name if any. Thanks to Mark Gravell for pushing this idea forward.	2013-01-15 13:34:10 +01:00
antirez	ef99e146a8	Undo slave-master handshake when SLAVEOF sets a new slave. Issue #828 shows how Redis was not correctly undoing a non-blocking connection attempt with the previous master when the master was set to a new address using the SLAVEOF command. This was also a result of lack of refactoring, so now there is a function to cancel the non blocking handshake with the master. The new function is now used when SLAVEOF NO ONE is called or when SLAVEOF is used to set the master to a different address.	2013-01-15 13:33:24 +01:00
antirez	a33869c330	Makefile.dep updated.	2013-01-11 23:50:32 +01:00
antirez	f5fa6824db	Comment in the call() function clarified a bit.	2013-01-10 11:19:40 +01:00
antirez	baee565029	Multiple fixes for EVAL (issue #872 ). 1) The event handler was no restored after a timeout condition if the command was eventually executed with success. 2) The command was not converted to EVAL in case of errors in the middle of the execution. 3) Terrible duplication of code without any apparent reason.	2013-01-10 10:46:05 +01:00
Bilal Husain	717e5ffb45	s/adiacent/adjacent/ fixed typo in a comment (step 2 memcheck)	2013-01-09 21:46:58 +05:30
antirez	d7740fc8f3	Better error reporting when fd event creation fails.	2013-01-03 14:29:34 +01:00
antirez	e9261b1e2d	ae.c: set errno when error is not a failing syscall. In this way the caller is able to perform better error checking or to use strerror() without the risk of meaningless error messages being displayed.	2013-01-03 14:29:20 +01:00
antirez	4dc1e0dd30	Fix overflow in mstime() in redis-cli and benchmark. The problem does not exist in the Redis server implementation of mstime() but is only limited to redis-cli and redis-benchmark. Thix fixes issue #839.	2012-12-20 15:20:55 +01:00
antirez	f1481d4a03	serverCron() frequency is now a runtime parameter (was REDIS_HZ). REDIS_HZ is the frequency our serverCron() function is called with. A more frequent call to this function results into less latency when the server is trying to handle very expansive background operations like mass expires of a lot of keys at the same time. Redis 2.4 used to have an HZ of 10. This was good enough with almost every setup, but the incremental key expiration algorithm was working a bit better under extreme pressure when HZ was set to 100 for Redis 2.6. However for most users a latency spike of 30 milliseconds when million of keys are expiring at the same time is acceptable, on the other hand a default HZ of 100 in Redis 2.6 was causing idle instances to use some CPU time compared to Redis 2.4. The CPU usage was in the order of 0.3% for an idle instance, however this is a shame as more energy is consumed by the server, if not important resources. This commit introduces HZ as a runtime parameter, that can be queried by INFO or CONFIG GET, and can be modified with CONFIG SET. At the same time the default frequency is set back to 10. In this way we default to a sane value of 10, but allows users to easily switch to values up to 500 for near real-time applications if needed and if they are willing to pay this small CPU usage penalty.	2012-12-14 17:10:40 +01:00
Salvatore Sanfilippo	a4d68dc541	Merge pull request #824 from ptjm/unstable Define _XOPEN_SOURCE appropriately on NetBSD.	2012-12-12 09:34:41 -08:00
Patrick TJ McPhee	289942b625	Define _XOPEN_SOURCE appropriately on NetBSD.	2012-12-12 10:49:12 -05:00
antirez	705874e31d	Fix config.h endianess detection to work on Linux / PPC64. Config.h performs endianess detection including OS-specific headers to define the endianess macros, or when this is not possible, checking the processor type via ifdefs. Sometimes when the OS-specific macro is included, only __BYTE_ORDER is defined, while BYTE_ORDER remains undefined. There is code at the end of config.h endianess detection in order to define the macros without the underscore, but it was not working correctly. This commit fixes endianess detection fixing Redis on Linux / PPC64 and possibly other systems.	2012-12-11 17:01:00 +01:00
antirez	ab2924cff3	Memory leak fixed: release client's bpop->keys dictionary. Refactoring performed after issue #801 resolution (see commit `2f87cf8b01`) introduced a memory leak that is fixed by this commit. I simply forgot to free the new allocated dictionary in the client structure trusting the output of "make test" on OSX. However due to changes in the "leaks" utility the test was no longer testing memory leaks. This problem was also fixed. Fortunately the CI test running at ci.redis.io spotted the bug in the valgrind run. The leak never ended into a stable release.	2012-12-03 12:12:53 +01:00
antirez	2f87cf8b01	Blocking POP: use a dictionary to store keys clinet side. To store the keys we block for during a blocking pop operation, in the case the client is blocked for more data to arrive, we used a simple linear array of redis objects, in the blockingState structure: robj *keys; int count; However in order to fix issue #801 we also use a dictionary in order to avoid to end in the blocked clients queue for the same key multiple times with the same client. The dictionary was only temporary, just to avoid duplicates, but since we create / destroy it there is no point in doing this duplicated work, so this commit simply use a dictionary as the main structure to store the keys we are blocked for. So instead of the previous fields we now just have: dict keys; This simplifies the code and reduces the work done by the server during a blocking POP operation.	2012-12-02 20:43:15 +01:00
antirez	4e6dd7bc86	Client should not block multiple times on the same key. Sending a command like: BLPOP foo foo foo foo 0 Resulted into a crash before this commit since the client ended being inserted in the waiting list for this key multiple times. This resulted into the function handleClientsBlockedOnLists() to fail because we have code like that: if (de) { list clients = dictGetVal(de); int numclients = listLength(clients); while(numclients--) { listNode clientnode = listFirst(clients); /* server clients here... */ } } The code to serve clients used to remove the served client from the waiting list, so if a client is blocking multiple times, eventually the call to listFirst() will return NULL or worse will access random memory since the list may no longer exist as it is removed by the function unblockClientWaitingData() if there are no more clients waiting for this list. To avoid making the rest of the implementation more complex, this commit modifies blockForKeys() so that a client will be put just a single time into the waiting list for a given key. Since it is Saturday, I hope this fixes issue #801.	2012-12-02 20:43:07 +01:00
antirez	f50e658455	SDIFF is now able to select between two algorithms for speed. SDIFF used an algorithm that was O(N) where N is the total number of elements of all the sets involved in the operation. The algorithm worked like that: ALGORITHM 1: 1) For the first set, add all the members to an auxiliary set. 2) For all the other sets, remove all the members of the set from the auxiliary set. So it is an O(N) algorithm where N is the total number of elements in all the sets involved in the diff operation. Cristobal Viedma suggested to modify the algorithm to the following: ALGORITHM 2: 1) Iterate all the elements of the first set. 2) For every element, check if the element also exists in all the other remaining sets. 3) Add the element to the auxiliary set only if it does not exist in any of the other sets. The complexity of this algorithm on the worst case is O(N*M) where N is the size of the first set and M the total number of sets involved in the operation. However when there are elements in common, with this algorithm we stop the computation for a given element as long as we find a duplicated element into another set. I (antirez) added an additional step to algorithm 2 to make it faster, that is to sort the set to subtract from the biggest to the smallest, so that it is more likely to find a duplicate in a larger sets that are checked before the smaller ones. WHAT IS BETTER? None of course, for instance if the first set is much larger than the other sets the second algorithm does a lot more work compared to the first algorithm. Similarly if the first set is much smaller than the other sets, the original algorithm will less work. So this commit makes Redis able to guess the number of operations required by each algorithm, and select the best at runtime according to the input received. However, since the second algorithm has better constant times and can do less work if there are duplicated elements, an advantage is given to the second algorithm.	2012-11-30 16:36:42 +01:00
antirez	b4abbaf755	redis-benchmark: seed the PRNG with time() at startup.	2012-11-30 15:41:09 +01:00
antirez	2f62c9663c	Introduced the Build ID in INFO and --version output. The idea is to be able to identify a build in a unique way, so for instance after a bug report we can recognize that the build is the one of a popular Linux distribution and perform the debugging in the same environment.	2012-11-29 14:20:08 +01:00
antirez	b1b602a928	On crash memory test rewrote so that it actaully works. 1) We no longer test location by location, otherwise the CPU write cache completely makes our business useless. 2) We still need a memory test that operates in steps from the first to the last location in order to never hit the cache, but that is still able to retain the memory content. This was tested using a Linux box containing a bad memory module with a zingle bit error (always zero). So the final solution does has an error propagation step that is: 1) Invert bits at every location. 2) Swap adiacent locations. 3) Swap adiacent locations again. 4) Invert bits at every location. 5) Swap adiacent locations. 6) Swap adiacent locations again. Before and after these steps, and after step 4, a CRC64 checksum is computed. If the three CRC64 checksums don't match, a memory error was detected.	2012-11-29 10:24:35 +01:00
charsyam	dee0b939fc	Remove unnecessary condition in _dictExpandIfNeeded (dict.c)	2012-11-28 11:44:39 +01:00
antirez	c87a40897c	Merge remote-tracking branch 'origin/unstable' into unstable	2012-11-28 11:41:27 +01:00
Matt Arsenault	504e5072eb	It's a watchdog, not a watchdong.	2012-11-28 11:35:19 +01:00
charsyam	d7c7ac4a57	remove compile warning bioKillThreads	2012-11-23 05:52:39 +08:00
antirez	95f68f7b0f	EVALSHA is now case insensitive. EVALSHA used to crash if the SHA1 was not lowercase (Issue #783). Fixed using a case insensitive dictionary type for the sha -> script map used for replication of scripts.	2012-11-22 15:50:00 +01:00
antirez	cceb0c5b4a	Fix integer overflow in zunionInterGenericCommand(). This fixes issue #761.	2012-11-22 15:28:28 +01:00
antirez	3d1391272a	Safer handling of MULTI/EXEC on errors. After the transcation starts with a MULIT, the previous behavior was to return an error on problems such as maxmemory limit reached. But still to execute the transaction with the subset of queued commands on EXEC. While it is true that the client was able to check for errors distinguish QUEUED by an error reply, MULTI/EXEC in most client implementations uses pipelining for speed, so all the commands and EXEC are sent without caring about replies. With this change: 1) EXEC fails if at least one command was not queued because of an error. The EXECABORT error is used. 2) A generic error is always reported on EXEC. 3) The client DISCARDs the MULTI state after a failed EXEC, otherwise pipelining multiple transactions would be basically impossible: After a failed EXEC the next transaction would be simply queued as the tail of the previous transaction.	2012-11-22 10:32:07 +01:00
antirez	7536991726	Make bio.c threads killable ASAP if needed. We use this new bio.c feature in order to stop our I/O threads if there is a memory test to do on crash. In this case we don't want anything else than the main thread to run, otherwise the other threads may mess with the heap and the memory test will report a false positive.	2012-11-22 10:12:11 +01:00
antirez	5a9e3f5842	Fast memory test on Redis crash.	2012-11-21 13:24:44 +01:00
antirez	3cb432837c	Use more fine grained HAVE macros instead of HAVE_PROCFS.	2012-11-21 13:17:38 +01:00
charsyam	52b52a3508	fix randstring bug	2012-11-20 02:50:31 +08:00
antirez	49b6452351	Children creating AOF or RDB files now report memory used by COW. Finally Redis is able to report the amount of memory used by copy-on-write while saving an RDB or writing an AOF file in background. Note that this information is currently only logged (at NOTICE level) and not shown in INFO because this is less trivial (but surely doable with some minor form of interprocess communication). The reason we can't capture this information on the parent before we call wait3() is that the Linux kernel will release the child memory ASAP, and only retain the minimal state for the process that is useful to report the child termination to the parent. The COW size is obtained by summing all the Private_Dirty fields found in the "smap" file inside the proc filesystem for the process. All this is Linux specific and is not available on other systems.	2012-11-19 12:02:08 +01:00
antirez	3bfeb9c1a7	zmalloc_get_private_dirty() function added (Linux only). For non Linux systmes it just returns 0. This function is useful to estimate copy-on-write because of childs saving stuff on disk.	2012-11-19 11:47:35 +01:00
antirez	af0b220756	zmalloc: kill unused __size parameter in update_zmalloc_stat_alloc() macro.	2012-11-14 12:52:38 +01:00
antirez	a779b7e901	Merge branch 'migrate-cache' into unstable	2012-11-14 12:21:23 +01:00
antirez	2feef47aa1	MIGRATE: retry one time on I/O error. Now that we cache connections, a retry attempt makes sure that the operation don't fail just because there is an existing connection error on the socket, like the other end closing the connection. Unfortunately this condition is not detectable using getsockopt(SO_ERROR), so the only option left is to retry. We don't retry on timeouts.	2012-11-14 11:30:24 +01:00
antirez	aa2bf6ba8b	TTL API change: TTL returns -2 for non existing keys. The previous behavior was to return -1 if: 1) Existing key but without an expire set. 2) Non existing key. Now the second case is handled in a different, and TTL will return -2 if the key does not exist at all. PTTL follows the same behavior as well.	2012-11-12 23:04:36 +01:00
antirez	05705bc8bb	MIGRATE: fix default timeout to 1000 milliseconds. When a timeout <= 0 is provided we set a default timeout of 1 second. It was set to 1 millisecond for an error resulting from a recent change.	2012-11-12 18:54:35 +01:00
antirez	c8852ebf19	MIGRATE count of cached sockets in INFO output.	2012-11-12 14:01:56 +01:00
antirez	149b527a74	MIGRATE timeout should be in milliseconds. While it is documented that the MIGRATE timeout is in milliseconds, it was in seconds instead. This commit fixes the problem.	2012-11-12 14:01:02 +01:00
antirez	e23d281e48	MIGRATE TCP connections caching. By caching TCP connections used by MIGRATE to chat with other Redis instances a 5x performance improvement was measured with redis-benchmark against small keys. This can dramatically speedup cluster resharding and other processes where an high load of MIGRATE commands are used.	2012-11-12 00:47:24 +01:00
antirez	4365e5b2d3	BSD license added to every C source and header file.	2012-11-08 18:31:32 +01:00
antirez	1237d71c4e	COPY and REPLACE options for MIGRATE. With COPY now MIGRATE does not remove the key from the source instance. With REPLACE it uses RESTORE REPLACE on the target host so that even if the key already eixsts in the target instance it will be overwritten. The options can be used together.	2012-11-07 15:32:27 +01:00
antirez	e5b5763f56	REPLACE option for RESTORE. The REPLACE option deletes an existing key with the same name (if any) and materializes the new one. The default behavior without RESTORE is to return an error if a key already exists.	2012-11-07 10:57:23 +01:00
antirez	c4b0b6854e	Type mismatch errors are now prefixed with WRONGTYPE. So instead to reply with a generic error like: -ERR ... wrong kind of value ... now it replies with: -WRONGTYPE ... wrong kind of value ... This makes this particular error easy to check without resorting to (fragile) pattern matching of the error string (however the error string used to be consistent already). Client libraries should return a specific exeption type for this error. Most of the commit is about fixing unit tests.	2012-11-06 20:25:34 +01:00
Salvatore Sanfilippo	06851a93de	Merge pull request #741 from Run/typo fix a typo in redis.h line 595 comment	2012-11-02 04:10:47 -07:00
antirez	05d8e2c938	More robust handling of AOF rewrite child. After the wait3() syscall we used to do something like that: if (pid == server.rdb_child_pid) { backgroundSaveDoneHandler(exitcode,bysignal); } else { .... } So the AOF rewrite was handled in the else branch without actually checking if the pid really matches. This commit makes the check explicit and logs at WARNING level if the pid returned by wait3() does not match neither the RDB or AOF rewrite child.	2012-11-01 22:39:39 +01:00
Yecheng Fu	f0266532fc	fix typo in comments (redis.c, networking.c)	2012-11-01 22:26:46 +01:00
antirez	2ea41242f6	Unix socket clients properly displayed in MONITOR and CLIENT LIST. This also fixes issue #745.	2012-11-01 22:10:45 +01:00
Runzhen Wang	c23c657cdd	fix a typo in redis.h line 595 comment	2012-11-01 02:14:22 +08:00
antirez	973cb21a01	Invert two sides of if expression in SET to avoid a lookup. Because of the short circuit behavior of && inverting the two sides of the if expression avoids an hash table lookup if the non-EX variant of SET is called. Thanks to Weibin Yao (@yaoweibin on github) for spotting this.	2012-10-31 09:23:05 +01:00
antirez	b16e423430	No longer used macro rdbIsOpcode() removed.	2012-10-30 19:10:46 +01:00
antirez	08a4f12e76	help.h update (adds bitop, bitcount, evalsha...)	2012-10-30 18:57:20 +01:00
antirez	89423052ca	Marginally more robust glibc version test for sync_file_range detection.	2012-10-26 15:55:12 +02:00
charsyam	4800331bf8	patch config.h for sync_file_range	2012-10-26 04:27:58 +08:00
antirez	d2f83d4a76	Fix compilation on Linux kernels or glibc versions lacking sync_file_range(). This fixes issue #667. Many thanks to Didier Spezia for the fix.	2012-10-25 22:01:20 +02:00
antirez	68fc64afd4	Update memory peak stats while loading RDB / AOF.	2012-10-24 12:21:41 +02:00
antirez	89e74abfb6	A filed called slave_read_only added in INFO output. This was an important information missing from the INFO output in the replication section. It obviously reflects if the slave is read only or not.	2012-10-22 19:21:47 +02:00
Salvatore Sanfilippo	ecd82f59fe	Merge pull request #693 from ghurrell/dict-h-typos Fix (cosmetic) typos in dict.h	2012-10-22 02:55:23 -07:00
Schuster	e5f794ff3a	redis-check-dump now understands dumps produced by Redis 2.6 (Commit message from @antirez as it was missign in the original commits, also the patch was modified a bit to still work with 2.4 dumps and to avoid if expressions that are always true due to checked types range) This commit changes redis-check-dump to account for new encodings and for the new MSTIME expire format. It also refactors the test for valid type into a function. The code is still compatible with Redis 2.4 generated dumps. This fixes issue #709.	2012-10-22 11:44:20 +02:00
antirez	c2661ed761	Default memory limit for 32bit instanced moved from 3.5 GB to 3 GB. In some system, notably osx, the 3.5 GB limit was too far and not able to prevent a crash for out of memory. The 3 GB limit works better and it is still a lot of memory within a 4 GB theorical limit so it's not going to bore anyone :-) This fixes issue #711	2012-10-22 10:43:39 +02:00
antirez	acfe3675e3	Differentiate SCRIPT KILL error replies. When calling SCRIPT KILL currently you can get two errors: * No script in timeout (busy) state. * The script already performed a write. It is useful to be able to distinguish the two errors, but right now both start with "ERR" prefix, so string matching (that is fragile) must be used. This commit introduces two different prefixes. -NOTBUSY and -UNKILLABLE respectively to reply with an error when no script is busy at the moment, and when the script already executed a write operation and can not be killed.	2012-10-22 10:31:28 +02:00
antirez	a1b1c1ea3a	Fix MULTI / EXEC rendering in MONITOR output. Before of this commit it used to be like this: MULTI EXEC ... actual commands of the transaction ... Because after all that is the natural order of things. Transaction commands are queued and executed only after EXEC is called. However this makes debugging with MONITOR a mess, so the code was modified to provide a coherent output. What happens is that MULTI is rendered in the MONITOR output as far as possible, instead EXEC is propagated only after the transaction is executed, or even in the case it fails because of WATCH, so in this case you'll simply see: MULTI EXEC An empty transaction.	2012-10-16 17:35:50 +02:00
antirez	be6cbd3a6e	Allow AUTH when Redis is busy because of timedout Lua script. If the server is password protected we need to accept AUTH when there is a server busy (-BUSY) condition, otherwise it will be impossible to send SHUTDOWN NOSAVE or SCRIPT KILL. This fixes issue #708.	2012-10-11 18:34:05 +02:00
NanXiao	9eb3a7bc6b	Update src/redis-benchmark.c The code of current implementation: if (c->pending == 0) clientDone(c); In clientDone function, the c's memory has been freed, then the loop will continue: while(c->pending). The memory of c has been freed now, so c->pending is invalid (c is an invalid pointer now), and this will cause memory dump in some platforams(eg: Solaris). So I think the code should be modified as: if (c->pending == 0) { clientDone(c); break; } and this will not lead to while(c->pending).	2012-10-10 17:08:43 +08:00
antirez	da920e75d4	Hash function switched to murmurhash2. The previously used hash function, djbhash, is not secure against collision attacks even when the seed is randomized as there are simple ways to find seed-independent collisions. The new hash function appears to be safe (or much harder to exploit at least) in this case, and has better distribution. Better distribution does not always means that's better. For instance in a fast benchmark with "DEBUG POPULATE 1000000" I obtained the following results: 1.6 seconds with djbhash 2.0 seconds with murmurhash2 This is due to the fact that djbhash will hash objects that follow the pattern `prefix:<id>` and where the id is numerically near, to near buckets. This improves the locality. However in other access patterns with keys that have no relation murmurhash2 has some (apparently minimal) speed advantage. On the other hand a better distribution should significantly improve the quality of the distribution of elements returned with dictGetRandomKey() that is used in SPOP, SRANDMEMBER, RANDOMKEY, and other commands. Everything considered, and under the suspect that this commit fixes a security issue in Redis, we are switching to the new hash function. If some serious speed regression will be found in the future we'll be able to step back easiliy. This commit fixes issue #663.	2012-10-05 11:20:13 +02:00
antirez	c43aea7e9f	Warn when configured maxmemory value seems odd. This commit warns the user with a log at "warning" level if: 1) After the server startup the maxmemory limit was found to be < 1MB. 2) After a CONFIG SET command modifying the maxmemory setting the limit is set to a value that is smaller than the currently used memory. The behaviour of the Redis server is unmodified, and this wil not make the CONFIG SET command or a wrong configuration in redis.conf less likely to create problems, but at least this will make aware most users about a possbile error they committed without resorting to external help. However no warning is issued if, as a result of loading the AOF or RDB file, we are very near the maxmemory setting, or key eviction will be needed in order to go under the specified maxmemory setting. The reason is that in servers configured as a cache with an aggressive maxmemory-policy most of the times restarting the server will cause this condition to happen if persistence is not switched off. This fixes issue #429.	2012-10-05 11:16:22 +02:00
antirez	2b73b3509a	Include time.h in ae.c as we now use time().	2012-10-05 10:10:43 +02:00
Jokea	b7b2a1cc5e	Force expire all timer events when system clock skew is detected. When system time changes back, the timer will not worker properly hence some core functionality of redis will stop working(e.g. replication, bgsave, etc). See issue #633 for details. The patch saves the previous time and when a system clock skew is detected, it will force expire all timers. Modiifed by @antirez: the previous time was moved into the eventLoop structure to make sure the library is still thread safe as long as you use different event loops into different threads (otherwise you need some synchronization). More comments added about the reasoning at the base of the patch, that's worth reporting here: /* If the system clock is moved to the future, and then set back to the * right value, time events may be delayed in a random way. Often this * means that scheduled operations will not be performed soon enough. * * Here we try to detect system clock skews, and force all the time * events to be processed ASAP when this happens: the idea is that * processing events earlier is less dangerous than delaying them * indefinitely, and practice suggests it is. */	2012-10-04 19:30:42 +02:00
antirez	f0b9f80345	"Timeout receiving bulk data" error message modified. The new message now contains an hint about modifying the repl-timeout configuration directive if the problem persists. This should normally not be needed, because while the master generates the RDB file it makes sure to send newlines to the replication channel to prevent timeouts. However there are times when masters running on very slow systems can completely stop for seconds during the RDB saving process. In such a case enlarging the timeout value can fix the problem. See issue #695 for an example of this problem in an EC2 deployment.	2012-10-04 11:52:16 +02:00
antirez	9a914a632d	"SORT by nosort" (skip sorting) respect sorted set ordering. When SORT is called with the option BY set to a string constant not inclduing the wildcard character "*", there is no way to sort the output so any ordering is valid. This allows the SORT internals to optimize its work and don't really sort the output at all. However it was odd that this option was not able to retain the natural order of a sorted set. This feature was requested by users multiple times as sometimes to call SORT with GET against sorted sets as a way to mass-fetch objects can be handy. This commit introduces two things: 1) The ability of SORT to return sorted sets elements in their natural ordering when `BY nosort` is specified, accordingly to `DESC / ASC` options. 2) The ability of SORT to optimize this case further if LIMIT is passed as well, avoiding to really fetch the whole sorted set, but directly obtaining the specified range. Because in this case the sorting is always deterministic, no post-sorting activity is performed when SORT is called from a Lua script. This commit fixes issue #98.	2012-10-03 14:54:43 +02:00
Greg Hurrell	4b1f6ad3e7	Fix (cosmetic) typos in dict.h	2012-10-02 22:01:26 -07:00
antirez	ece77037e9	Revert "Scripting: redis.NIL to return nil bulk replies." This reverts commit `e061d797d7`. Conflicts: src/scripting.c	2012-10-01 10:10:31 +02:00
antirez	9c21b72bb9	Scripting: add helper functions redis.error_reply() and redis.status_reply(). A previous commit introduced Redis.NIL. This commit adds similar helper functions to return tables with a single field set to the specified string so that instead of using 'return {err="My Error"}' it is possible to use a more idiomatic form: return redis.error_reply("My Error") return redis.status_reply("OK")	2012-09-28 16:54:57 +02:00
antirez	6dd1693c0e	Scripting: redis.NIL to return nil bulk replies. Lua arrays can't contain nil elements (see http://www.lua.org/pil/19.1.html for more information), so Lua scripts were not able to return a multi-bulk reply containing nil bulk elements inside. This commit introduces a special conversion: a table with just a "nilbulk" field set to a boolean value is converted by Redis as a nil bulk reply, but at the same time for Lua this type is not a "nil" so can be used inside Lua arrays. This type is also assigned to redis.NIL, so the following two forms are equivalent and will be able to return a nil bulk reply as second element of a three elements array: EVAL "return {1,redis.NIL,3}" 0 EVAL "return {1,{nilbulk=true},3}" 0 The result in redis-cli will be: 1) (integer) 1 2) (nil) 3) (integer) 3	2012-09-28 14:26:20 +02:00
antirez	db100c4671	Sentinel: Support for AUTH.	2012-09-26 18:59:54 +02:00
antirez	578c94597f	SRANDMEMBER <count> leak fixed. For "CASE 4" (see code) we need to free the element if it's already in the result dictionary and adding it failed.	2012-09-21 11:55:32 +02:00
antirez	be90c803e3	Added the SRANDMEMBER key <count> variant. SRANDMEMBER called with just the key argument can just return a single random element from a Redis Set. However many users need to return multiple unique elements from a Set, this is not a trivial problem to handle in the client side, and for truly good performance a C implementation was required. After many requests for this feature it was finally implemented. The problem implementing this command is the strategy to follow when the number of elements the user asks for is near to the number of elements that are already inside the set. In this case asking random elements to the dictionary API, and trying to add it to a temporary set, may result into an extremely poor performance, as most add operations will be wasted on duplicated elements. For this reason this implementation uses a different strategy in this case: the Set is copied, and random elements are returned to reach the specified count. The code actually uses 4 different algorithms optimized for the different cases. If the count is negative, the command changes behavior and allows for duplicated elements in the returned subset.	2012-09-21 11:55:28 +02:00
antirez	d310fbedab	Fix compilation on FreeBSD. Thanks to @koobs on twitter.	2012-09-17 12:46:06 +02:00
antirez	7eb850ef0e	A reimplementation of blocking operation internals. Redis provides support for blocking operations such as BLPOP or BRPOP. This operations are identical to normal LPOP and RPOP operations as long as there are elements in the target list, but if the list is empty they block waiting for new data to arrive to the list. All the clients blocked waiting for th same list are served in a FIFO way, so the first that blocked is the first to be served when there is more data pushed by another client into the list. The previous implementation of blocking operations was conceived to serve clients in the context of push operations. For for instance: 1) There is a client "A" blocked on list "foo". 2) The client "B" performs `LPUSH foo somevalue`. 3) The client "A" is served in the context of the "B" LPUSH, synchronously. Processing things in a synchronous way was useful as if "A" pushes a value that is served by "B", from the point of view of the database is a NOP (no operation) thing, that is, nothing is replicated, nothing is written in the AOF file, and so forth. However later we implemented two things: 1) Variadic LPUSH that could add multiple values to a list in the context of a single call. 2) BRPOPLPUSH that was a version of BRPOP that also provided a "PUSH" side effect when receiving data. This forced us to make the synchronous implementation more complex. If client "B" is waiting for data, and "A" pushes three elemnents in a single call, we needed to propagate an LPUSH with a missing argument in the AOF and replication link. We also needed to make sure to replicate the LPUSH side of BRPOPLPUSH, but only if in turn did not happened to serve another blocking client into another list ;) This were complex but with a few of mutually recursive functions everything worked as expected... until one day we introduced scripting in Redis. Scripting + synchronous blocking operations = Issue #614. Basically you can't "rewrite" a script to have just a partial effect on the replicas and AOF file if the script happened to serve a few blocked clients. The solution to all this problems, implemented by this commit, is to change the way we serve blocked clients. Instead of serving the blocked clients synchronously, in the context of the command performing the PUSH operation, it is now an asynchronous and iterative process: 1) If a key that has clients blocked waiting for data is the subject of a list push operation, We simply mark keys as "ready" and put it into a queue. 2) Every command pushing stuff on lists, as a variadic LPUSH, a script, or whatever it is, is replicated verbatim without any rewriting. 3) Every time a Redis command, a MULTI/EXEC block, or a script, completed its execution, we run the list of keys ready to serve blocked clients (as more data arrived), and process this list serving the blocked clients. 4) As a result of "3" maybe more keys are ready again for other clients (as a result of BRPOPLPUSH we may have push operations), so we iterate back to step "3" if it's needed. The new code has a much simpler semantics, and a simpler to understand implementation, with the disadvantage of not being able to "optmize out" a PUSH+BPOP as a No OP. This commit will be tested with care before the final merge, more tests will be added likely.	2012-09-17 10:26:46 +02:00
antirez	bfc197c3b6	Make sure that SELECT argument is an integer or return an error. Unfortunately we had still the lame atoi() without any error checking in place, so "SELECT foo" would work as "SELECT 0". This was not an huge problem per se but some people expected that DB can be strings and not just numbers, and without errors you get the feeling that they can be numbers, but not the behavior. Now getLongFromObjectOrReply() is used as almost everybody else across the code, generating an error if the number is not an integer or overflows the long type. Thanks to @mipearson for reporting that on Twitter.	2012-09-11 10:32:04 +02:00
antirez	978e5177fd	Match printf format with actual type in genRedisInfoString().	2012-09-10 12:42:55 +02:00
Haruto Otake	749aac72ad	BITCOUNT: fix segmentation fault. remove unsafe and unnecessary cast. until now, this cast may lead segmentation fault when end > UINT_MAX setbit foo 0 1 bitcount 0 4294967295 => ok bitcount 0 4294967296 => cause segmentation fault. Note by @antirez: the commit was modified a bit to also change the string length type to long, since it's guaranteed to be at max 512 MB in size, so we can work with the same type across all the code path. A regression test was also added.	2012-09-05 16:19:04 +02:00
Salvatore Sanfilippo	24bc807b5c	Merge pull request #576 from saj/fix-slave-ping-period Bug fix: slaves being pinged every second	2012-09-05 06:59:37 -07:00
antirez	36741b2c81	Scripting: Force SORT BY constant determinism inside SORT itself. SORT is able to return (faster than when ordering) unordered output if the "BY" clause is used with a constant value. However we try to play well with scripting requirements of determinism providing always sorted outputs when SORT (and other similar commands) are called by Lua scripts. However we used the general mechanism in place in scripting in order to reorder SORT output, that is, if the command has the "S" flag set, the Lua scripting engine will take an additional step when converting a multi bulk reply to Lua value, calling a Lua sorting function. This is suboptimal as we can do it faster inside SORT itself. This is also broken as issue #545 shows us: basically when SORT is used with a constant BY, and additionally also GET is used, the Lua scripting engine was trying to order the output as a flat array, while it was actually a list of key-value pairs. What we do know is to recognized if the caller of SORT is the Lua client (since we can check this using the REDIS_LUA_CLIENT flag). If so, and if a "don't sort" condition is triggered by the BY option with a constant string, we force the lexicographical sorting. This commit fixes this bug and improves the performance, and at the same time simplifies the implementation. This does not mean I'm smart today, it means I was stupid when I committed the original implementation ;)	2012-09-05 01:17:49 +02:00
antirez	9bd0e097aa	Sentinel: reply -IDONTKNOW to get-master-addr-by-name on lack of info. If we don't have any clue about a master since it never replied to INFO so far, reply with an -IDONTKNOW error to SENTINEL get-master-addr-by-name requests.	2012-09-04 16:06:53 +02:00
antirez	8bdde086ac	Sentinel: more easy master redirection if master is a slave. Before this commit Sentienl used to redirect master ip/addr if the current instance reported to be a slave only if this was the first INFO output received, and the role was found to be slave. Now instead also if we find that the runid is different, and the reported role is slave, we also redirect to the reported master ip/addr. This unifies the behavior of Sentinel in the case of a reboot (where it will see the first INFO output with the wrong role and will perform the redirection), with the behavior of Sentinel in the case of a change in what it sees in the INFO output of the master.	2012-09-04 15:52:04 +02:00
antirez	bb66fc3120	Send an async PING before starting replication with master. During the first synchronization step of the replication process, a Redis slave connects with the master in a non blocking way. However once the connection is established the replication continues sending the REPLCONF command, and sometimes the AUTH command if needed. Those commands are send in a partially blocking way (blocking with timeout in the order of seconds). Because it is common for a blocked master to accept connections even if it is actually not able to reply to the slave requests, it was easy for a slave to block if the master had serious issues, but was still able to accept connections in the listening socket. For this reason we now send an asynchronous PING request just after the non blocking connection ended in a successful way, and wait for the reply before to continue with the replication process. It is very unlikely that a master replying to PING can't reply to the other commands. This solution was proposed by Didier Spezia (Thanks!) so that we don't need to turn all the replication process into a non blocking affair, but still the probability of a slave blocked is minimal even in the event of a failing master. Also we now use getsockopt(SO_ERROR) in order to check errors ASAP in the event handler, instead of waiting for actual I/O to return an error. This commit fixes issue #632.	2012-09-02 12:24:38 +02:00
antirez	e323635c2d	Scripting: Reset Lua fake client reply_bytes after command execution. Lua scripting uses a fake client in order to run commands in the context of a client, accumulate the reply, and convert it into a Lua object to return to the caller. This client is reused again and again, and is referenced by the server.lua_client globally accessible pointer. However after every call to redis.call() or redis.pcall(), that is handled by the luaRedisGenericCommand() function, the reply_bytes field of the client was not set back to zero. This filed is used to estimate the amount of memory currently used in the reply. Because of the lack of reset, script after script executed, this value used to get bigger and bigger, and in the end on 32 bit systems it triggered the following assert: redisAssert(c->reply_bytes < ULONG_MAX-(1024*64)); On 64 bit systems this does not happen because it takes too much time to reach values near to 2^64 for users to see the practical effect of the bug. Now in the cleanup stage of luaRedisGenericCommand() we reset the reply_bytes counter to zero, avoiding the issue. It is not practical to add a test for this bug, but the fix was manually tested using a debugger. This commit fixes issue #656.	2012-08-31 11:15:02 +02:00
antirez	46c31a150a	Scripting: require at least one argument for redis.call(). Redis used to crash with a call like the following: EVAL "redis.call()" 0 Now the explicit check for at least one argument prevents the problem. This commit fixes issue #655.	2012-08-31 10:28:13 +02:00
antirez	6276434ad2	Sentinel: do not crash against slaves not publishing the runid. Older versions of Redis (before 2.4.17) don't publish the runid field in INFO. This commit makes Sentinel able to handle that without crashing.	2012-08-30 18:01:52 +02:00
antirez	58186b9dcf	Sentinel: INFO command implementation.	2012-08-29 12:44:24 +02:00
antirez	8246e58abe	Sentinel: add Redis execution mode to INFO output. The new "redis_mode" field in the INFO output will show if Redis is running in standalone mode, cluster, or sentinel mode.	2012-08-29 11:44:01 +02:00
antirez	3ec701e059	Sentinel: Sentinel-side support for slave priority. The slave priority that is now published by Redis in INFO output is now used by Sentinel in order to select the slave with minimum priority for promotion, and in order to consider slaves with priority set to 0 as not able to play the role of master (they will never be promoted by Sentinel). The "slave-priority" field is now one of the fileds that Sentinel publishes when describing an instance via the SENTINEL commands such as "SENTINEL slaves mastername".	2012-08-28 17:45:01 +02:00
antirez	169a44cbd6	Sentinel: Redis-side support for slave priority. A Redis slave can now be configured with a priority, that is an integer number that is shown in INFO output and can be get and set using the redis.conf file or the CONFIG GET/SET command. This field is used by Sentinel during slave election. A slave with lower priority is preferred. A slave with priority zero is never elected (and is considered to be impossible to elect even if it is the only slave available). A next commit will add support in the Sentinel side as well.	2012-08-28 17:20:26 +02:00
antirez	c14e0ecafd	Sentinel: suppress harmless warning by initializing 'table' to NULL. Note that the assertion guarantees that one of the if branches setting table is always entered.	2012-08-28 12:56:05 +02:00
antirez	784b93087c	Incrementally flush RDB on disk while loading it from a master. This fixes issue #539. Basically if there is enough free memory the OS may buffer the RDB file that the slave transfers on disk from the master. The file may actually be flused on disk at once by the operating system when it gets closed by Redis, causing the close system call to block for a long time. This patch is a modified version of one provided by yoav-steinberg of @garantiadata (the original version was posted in the issue #539 comments), and tries to flush the OS buffers incrementally (every 8 MB of loaded data).	2012-08-28 12:47:33 +02:00
antirez	1caa627e4e	Fix a forget zmalloc_oom() -> zmalloc_oom_handler() replacement.	2012-08-24 15:40:22 +02:00
antirez	6fdc635447	Better Out of Memory handling. The previous implementation of zmalloc.c was not able to handle out of memory in an application-specific way. It just logged an error on standard error, and aborted. The result was that in the case of an actual out of memory in Redis where malloc returned NULL (In Linux this actually happens under specific overcommit policy settings and/or with no or little swap configured) the error was not properly logged in the Redis log. This commit fixes this problem, fixing issue #509. Now the out of memory is properly reported in the Redis log and a stack trace is generated. The approach used is to provide a configurable out of memory handler to zmalloc (otherwise the default one logging the event on the standard output is used).	2012-08-24 12:55:37 +02:00
antirez	850789ce73	Sentinel: send SCRIPT KILL on -BUSY reply and SDOWN instance. From the point of view of Redis an instance replying -BUSY is down, since it is effectively not able to reply to user requests. However a looping script is a recoverable condition in Redis if the script still did not performed any write to the dataset. In that case performing a fail over is not optimal, so Sentinel now tries to restore the normal server condition killing the script with a SCRIPT KILL command. If the script already performed some write before entering an infinite (or long enough to timeout) loop, SCRIPT KILL will not work and the fail over will be triggered anyway.	2012-08-24 12:29:54 +02:00
antirez	01477753e6	Sentinel: fixed a crash on script execution. The call to sentinelScheduleScriptExecution() lacked the final NULL argument to signal the end of arguments. This resulted into a crash.	2012-08-24 12:10:24 +02:00
Salvatore Sanfilippo	724371d748	Merge pull request #628 from pietern/unstable-zip Fix ziplist edge case	2012-08-22 02:32:27 -07:00
antirez	227b429364	redis-benchmark: disable big buffer cleanup in hiredis context. This new hiredis features allows us to reuse a previous context reader buffer even if already very big in order to maximize performances with big payloads (Usually hiredis re-creates buffers when they are too big and unused in order to save memory).	2012-08-21 17:31:44 +02:00
Pieter Noordhuis	3cc2904e3e	Set p to its new offset before modifying it	2012-08-13 14:13:09 -07:00
Pieter Noordhuis	5117c20ab5	Add ziplist test for deleting next to last entries	2012-08-13 14:09:40 -07:00
antirez	cada7f9671	Sentinel: SENTINEL FAILOVER command implemented. This command can be used in order to force a Sentinel instance to start a failover for the specified master, as leader, forcing the failover even if the master is up. The commit also adds some minor refactoring and other improvements to functions already implemented that make them able to work when the master is not in SDOWN condition. For instance slave selection assumed that we ask INFO every second to every slave, this is true only when the master is in SDOWN condition, so slave selection did not worked when the master was not in SDOWN condition.	2012-08-03 12:41:27 +02:00
antirez	6275004ca6	Sentinel: client reconfiguration script execution. This commit adds support to optionally execute a script when one of the following events happen: * The failover starts (with a slave already promoted). * The failover ends. * The failover is aborted. The script is called with enough parameters (documented in the example sentinel.conf file) to provide information about the old and new ip:port pair of the master, the role of the sentinel (leader or observer) and the name of the master. The goal of the script is to inform clients of the configuration change in a way specific to the environment Sentinel is running, that can't be implemented in a genereal way inside Sentinel itself.	2012-08-02 18:40:30 +02:00
antirez	fd92b366b0	Sentinel: when leader in wait-start, sense another leader as race. When we are in wait start, if another leader (or any other external entity) turns a slave into a master, abort the failover, and detect it as an observer. Note that the wait-start state is mainly there for this reason but the abort was yet not implemented. This adds a new sentinel event -failover-abort-race.	2012-07-31 17:11:26 +02:00
antirez	91c15ed1b5	Sentinel: sentinelRefreshInstanceInfo() comments improved a bit.	2012-07-31 16:18:15 +02:00
Michael Parker	f1d187bb3e	Use correct variable name for value to convert. Note by @antirez: this code was never compiled because utils.c lacked the float.h include, so we never noticed this variable was mispelled in the past. This should provide a noticeable speed boost when saving certain types of databases with many sorted sets inside.	2012-07-31 11:48:00 +02:00
antirez	75084e057d	Sentinel: abort failover when in wait-start if master is back. When we are a Leader Sentinel in wait-start state, starting with this commit the failover is aborted if the master returns online. This improves the way we handle a notable case of net split, that is the split between Sentinels and Redis servers, that will be a very common case of split becase Sentinels will often be installed in the client's network and servers can be in a differnt arm of the network. When Sentinels and Redis servers are isolated the master is in ODOWN condition since the Sentinels can agree about this state, however the failover does not start since there are no good slaves to promote (in this specific case all the slaves are unreachable). However when the split is resolved, Sentinels may sense the slave back a moment before they sense the master is back, so the failover may start without a good reason (since the master is actually working too). Now this condition is reversible, so the failover will be aborted immediately after if the master is detected to be working again, that is, not in SDOWN nor in ODOWN condition.	2012-07-31 10:19:34 +02:00
antirez	7f5bdba434	Merge remote-tracking branch 'origin/unstable' into unstable	2012-07-28 20:55:17 +02:00
antirez	3f194a9d25	Sentinel: scripts execution engine improved. We no longer use a vanilla fork+execve but take a queue of jobs of scripts to execute, with retry on error, timeouts, and so forth. Currently this is used only for notifications but soon the ability to also call clients reconfiguration scripts will be added.	2012-07-28 20:54:27 +02:00
Jan-Erik Rediger	c6c19c8372	Include sys/wait.h to avoid compiler warning gcc warned about an implicit declaration of function 'wait3'. Including this header fixes this.	2012-07-28 12:33:01 +03:00
Salvatore Sanfilippo	ed7b308c1c	Merge pull request #587 from saj/truncate-short-write-from-aof Truncate short write from the AOF	2012-07-27 03:56:48 -07:00
Salvatore Sanfilippo	04950a9e4d	Merge pull request #586 from saj/aof_last_bgrewrite_status New in INFO: aof_last_bgrewrite_status	2012-07-27 03:55:20 -07:00
antirez	ce7b838fb9	Sentinel: don't start a failover as leader if there is no good slave.	2012-07-26 12:09:40 +02:00
antirez	baace5fc42	Sentinel: ability to execute notification scripts.	2012-07-25 16:33:37 +02:00
Salvatore Sanfilippo	42c571864e	Merge pull request #603 from mrb/fix_sentinel_config_warning Fix warning in redis.c for sentinel config load	2012-07-25 07:15:53 -07:00
mrb	f1c8661e74	Fix warning in redis.c for sentinel config load	2012-07-25 09:55:53 -04:00
antirez	672102c2ce	Sentinel: abort failover if no good slave is available. The previous behavior of the state machine was to wait some time and retry the slave selection, but this is not robust enough against drastic changes in the conditions of the monitored instances. What we do now when the slave selection fails is to abort the failover and return back monitoring the master. If the ODOWN condition is still present a new failover will be triggered and so forth. This commit also refactors the code we use to abort a failover.	2012-07-25 11:32:19 +02:00
antirez	9e5bef38e6	Sentinel: reset pending_commands in a more generic way.	2012-07-24 18:57:26 +02:00
antirez	a23a5b6c7d	Prevent a spurious +sdown event on switch. When we reset the master we should start with clean timestamps for ping replies otherwise we'll detect a spurious +sdown event, because on +master-switch event the previous master instance was probably in +sdown condition. Since we updated the address we should count time from scratch again. Also this commit makes sure to explicitly reset the count of pending commands, now we can do this because of the new way the hiredis link is closed.	2012-07-24 18:46:04 +02:00
antirez	d918e6f127	Sentinel: debugging message removed.	2012-07-24 18:20:05 +02:00
antirez	75fb6e5b8a	Sentinel: changes to connection handling and redirection. We disconnect the Redis instances hiredis link in a more robust way now. Also we change the way we perform the redirection for the +switch-master event, that is not just an instance reset with an address change. Using the same system we now implement the +redirect-to-master event that is triggered by an instance that is configured to be master but found to be a slave at the first INFO reply. In that case we monitor the master instead, logging the incident as an event.	2012-07-24 18:15:44 +02:00
antirez	2179c26916	Sentinel: check that instance still exists in reply callbacks. We can't be sure the instance object still exists when the reply callback is called.	2012-07-24 16:37:57 +02:00
antirez	d876d6feac	Sentinel: more robust failover detection as observer. Sentinel observers detect failover checking if a slave attached to the monitored master turns into its replication state from slave to master. However while this change may in theory only happen after a SLAVEOF NO ONE command, in practie it is very easy to reboot a slave instance with a wrong configuration that turns it into a master, especially if it was a past master before a successfull failover. This commit changes the detection policy so that if an instance goes from slave to master, but at the same time the runid has changed, we sense a reboot, and in that case we don't detect a failover at all. This commit also introduces the "reboot" sentinel event, that is logged at "warning" level (so this will trigger an admin notification). The commit also fixes a problem in the disconnect handler that assumed that the instance object always existed, that is not the case. Now we no longer assume that redisAsyncFree() will call the disconnection handler before returning.	2012-07-24 12:42:40 +02:00
antirez	6b5daa2df2	First implementation of Redis Sentinel. This commit implements the first, beta quality implementation of Redis Sentinel, a distributed monitoring system for Redis with notification and automatic failover capabilities. More info at http://redis.io/topics/sentinel	2012-07-23 13:14:44 +02:00
antirez	03f412ddef	Merge remote-tracking branch 'origin/unstable' into unstable	2012-07-22 17:18:42 +02:00
antirez	5d73073f6e	Allow Pub/Sub in contexts where other commands are blocked. Redis loading data from disk, and a Redis slave disconnected from its master with serve-stale-data disabled, are two conditions where commands are normally refused by Redis, returning an error. However there is no reason to disable Pub/Sub commands as well, given that this layer does not interact with the dataset. To allow Pub/Sub in as many contexts as possible is especially interesting now that Redis Sentinel uses Pub/Sub of a Redis master as a communication channel between Sentinels. This commit allows Pub/Sub to be used in the above two contexts where it was previously denied.	2012-07-22 17:18:16 +02:00
antirez	b62bdf1c64	Don't assume that "char" is signed. For the C standard char can be either signed or unsigned, it's up to the compiler, but Redis assumed that it was signed in a few places. The practical effect of this patch is that now Redis 2.6 will run correctly in every system where char is unsigned, notably the RaspBerry PI and other ARM systems with GCC. Thanks to Georgi Marinov (@eesn on twitter) that reported the problem and allowed me to use his RaspBerry via SSH to trace and fix the issue!	2012-07-18 12:04:58 +02:00
Saj Goonatilleke	55302e9e28	Truncate short write from the AOF If Redis only manages to write out a partial buffer, the AOF file won't load back into Redis the next time it starts up. It is better to discard the short write than waste time running redis-check-aof.	2012-07-18 10:35:17 +10:00
Saj Goonatilleke	48553a29e8	New in INFO: aof_last_bgrewrite_status Behaves like rdb_last_bgsave_status -- even down to reporting 'ok' when no rewrite has been done yet. (You might want to check that aof_last_rewrite_time_sec is not -1.)	2012-07-18 09:54:55 +10:00
Steeve Lennmark	e9828cb6f7	Check that we have connection before enabling pipe mode	2012-07-15 14:35:02 +02:00
Saj Goonatilleke	9edfe63553	Bug fix: slaves being pinged every second REDIS_REPL_PING_SLAVE_PERIOD controls how often the master should transmit a heartbeat (PING) to its slaves. This period, which defaults to 10, is measured in seconds. Redis 2.4 masters used to ping their slaves every ten seconds, just like it says on the tin. The Redis 2.6 masters I have been experimenting with, on the other hand, ping their slaves every second. (master_last_io_seconds_ago never approaches 10.) I think the ping period was inadvertently slashed to one-tenth of its nominal value around the time REDIS_HZ was introduced. This commit reintroduces correct ping schedule behaviour.	2012-07-05 14:29:27 +10:00
jokea	93b0075d33	mark fd as writable when EPOLLERR or EPOLLHUP is returned by epoll_wait.	2012-06-29 12:06:38 +08:00
antirez	36def8fd9a	Typo in comment.	2012-06-27 11:26:44 +02:00
antirez	3a32897856	REPLCONF internal command introduced. The REPLCONF command is an internal command (not designed to be directly used by normal clients) that allows a slave to set some replication related state in the master before issuing SYNC to start the replication. The initial motivation for this command, and the only reason currently it is used by the implementation, is to let the slave instance communicate its listening port to the slave, so that the master can show all the slaves with their listening ports in the "replication" section of the INFO output. This allows clients to auto discover and query all the slaves attached into a master. Currently only a single option of the REPLCONF command is supported, and it is called "listening-port", so the slave now starts the replication process with something like the following chat: REPLCONF listening-prot 6380 SYNC Note that this works even if the master is an older version of Redis and does not understand REPLCONF, because the slave ignores the REPLCONF error. In the future REPLCONF can be used for partial replication and other replication related features where there is the need to exchange information between master and slave. NOTE: This commit also fixes a bug: the INFO outout already carried information about slaves, but the port was broken, and was obtained with getpeername(2), so it was actually just the ephemeral port used by the slave to connect to the master as a client.	2012-06-27 09:43:57 +02:00
antirez	5410168c6e	Fixed comment typo into time_independent_strcmp().	2012-06-21 14:25:53 +02:00
antirez	31a1439bfd	Fixed a timing attack on AUTH (Issue #560 ). The way we compared the authentication password using strcmp() allowed an attacker to gain information about the password using a well known class of attacks called "timing attacks". The bug appears to be practically not exploitable in most modern systems running Redis since even using multiple bytes of differences in the input at a time instead of one the difference in running time in in the order of 10 nanoseconds, making it hard to exploit even on LAN. However attacks always get better so we are providing a fix ASAP. The new implementation uses two fixed length buffers and a constant time comparison function, with the goal of: 1) Completely avoid leaking information about the content of the password, since the comparison is always performed between 512 characters and without conditionals. 2) Partially avoid leaking information about the length of the password. About "2" we still have a stage in the code where the real password and the user provided password are copied in the static buffers, we also run two strlen() operations against the two inputs, so the running time of the comparison is a fixed amount plus a time proportional to LENGTH(A)+LENGTH(B). This means that the absolute time of the operation performed is still related to the length of the password in some way, but there is no way to change the input in order to get a difference in the execution time in the comparison that is not just proportional to the string provided by the user (because the password length is fixed). Thus in practical terms the user should try to discover LENGTH(PASSWORD) looking at the whole execution time of the AUTH command and trying to guess a proportionality between the whole execution time and the password length: this appears to be mostly unfeasible in the real world. Also protecting from this attack is not very useful in the case of Redis as a brute force attack is anyway feasible if the password is too short, while with a long password makes it not an issue that the attacker knows the length.	2012-06-21 11:50:01 +02:00
antirez	5b63ccce6c	Fix c->reply_bytes computation in setDeferredMultiBulkLength() In order to implement reply buffer limits introduced in 2.6 and useful to close the connection under user-selected circumastances of big output buffers (for instance slow consumers in pub/sub, a blocked slave, and so forth) Redis takes a counter with the amount of used memory in objects inside the output list stored into c->reply. The computation was broken in the function setDeferredMultiBulkLength(), in the case the object was glued with the next one. This caused the c->reply_bytes field to go out of sync, be subtracted more than needed, and wrap back near to ULONG_MAX values. This commit fixes this bug and adds an assertion that is able to trap this class of problems. This problem was discovered looking at the INFO output of an unrelated issue (issue #547).	2012-06-15 10:03:25 +02:00
antirez	ba779119b8	ziplistFind(): don't assume that entries are comparable by encoding. Because Redis 2.6 introduced new integer encodings it is no longer true that if two entries have a different encoding they are not equal. An old ziplist can be loaded from an RDB file generated with Redis 2.4, in this case for instance a small unsigned integers is encoded with a 16 bit encoding, while in Redis 2.6 a more specific 8 bit encoding format is used. Because of this bug hashes ended with duplicated values or fields lookup failed, causing many bad behaviors. This in turn caused a crash while converting the ziplist encoded hash into a real hash table because an assertion was raised on duplicated elements. This commit fixes issue #547. Many thanks to Pinterest's Marty Weiner and colleagues for discovering the problem and helping us in the debugging process.	2012-06-14 16:01:27 +02:00
Ted Nyman	d665dd0865	Standardize punctuation in redis-cli help. Right there is a mix of help entries ending with periods or without periods. This standardizes the end of command as without periods, which seems to be the general custom in most unix tools, at least.	2012-06-12 22:35:00 -07:00
antirez	ee789e157c	Dump ziplist hex value on failed assertion. The ziplist -> hashtable conversion code is triggered every time an hash value must be promoted to a full hash table because the number or size of elements reached the threshold. If a problem in the ziplist causes the same field to be present multiple times, the assertion of successful addition of the element inside the hash table will fail, crashing server with a failed assertion, but providing little information about the problem. This code adds a new logging function to perform the hex dump of binary data, and makes sure that the ziplist -> hashtable conversion code uses this new logging facility to dump the content of the ziplist when the assertion fails. This change was originally made in order to investigate issue #547.	2012-06-12 00:41:48 +02:00
Alex Mitrofanov	51857c7e5c	Fixed RESTORE hash failure (Issue #532 ) (additional commit notes by antirez@gmail.com): The rdbIsObjectType() macro was not updated when the new RDB object type of ziplist encoded hashes was added. As a result RESTORE, that uses rdbLoadObjectType(), failed when a ziplist encoded hash was loaded. This does not affected normal RDB loading because in that case we use the lower-level function rdbLoadType(). The commit also adds a regression test.	2012-06-02 10:24:27 +02:00
antirez	c7a25200e2	RDB type loading functions clarified in comments. Improved comments to make clear that rdbLoadType() just loads a general TYPE in the context of RDB that can be an object type or an expire type, end-of-file, and so forth. While rdbLoadObjectType() enforces that the type is a valid Object Type otherwise it returns -1.	2012-06-02 10:21:57 +02:00

... 8 9 10 11 12 ...

2259 Commits