redis

Commit Graph

Author	SHA1	Message	Date
antirez	6f4fd55762	Sentinel: fixes inverted strcmp() test preventing config updates. The result of this one-char bug was pretty serious, if the new master had the same port of the previous master, but just a different IP address, non-leader Sentinels would not be able to recognize the configuration change. This commit fixes issue #1394. Many thanks to @shanemadden that reported the bug and helped investigating it.	2013-11-25 10:59:53 +01:00
antirez	8d547ebd56	Sentinel: fix type specifier for Hello msg generation. This fixes issue #1395.	2013-11-25 10:24:34 +01:00
antirez	cc6053681f	Sentinel: different comments updated to new implementation.	2013-11-21 16:22:59 +01:00
antirez	685e79998c	Sentinel: cleanup around SENTINEL_INFO_VALIDITY_TIME.	2013-11-21 16:05:41 +01:00
antirez	489d889726	Sentinel: removed mem leak and useless code.	2013-11-21 15:43:55 +01:00
antirez	1ca56ea796	Merge branch 'newsentinel' into unstable	2013-11-21 15:01:25 +01:00
antirez	f55ad3038f	Sentinel: manual failover works again.	2013-11-21 12:39:47 +01:00
antirez	297de1ab26	Sentinel: test for writable config file. This commit introduces a funciton called when Sentinel is ready for normal operations to avoid putting Sentinel specific stuff in redis.c.	2013-11-21 12:28:15 +01:00
antirez	d920177f8d	Sentinel: check for disconnected links in sentinelSendHello(). Does not fix any bug as the test is performed by the caller, but better to have the check.	2013-11-21 11:35:50 +01:00
antirez	8810167d13	Sentinel: Hello message sending code refactored.	2013-11-21 11:31:06 +01:00
antirez	0101c2bcfe	Sentinel: select slave with best (greater) replication offset.	2013-11-20 16:05:36 +01:00
antirez	a6ebd910d8	Sentinel: take the replication offset in slaves state.	2013-11-20 15:53:21 +01:00
antirez	b1f5a0b3ec	CONFIG REWRITE: don't add the signature if it already exists. At the end of the file, CONFIG REWRITE adds a comment line that: # Generated by CONFIG REWRITE Followed by the additional config options required. However this was added again and again at every rewrite in praticular conditions (when a given set of options change in a given time during the time). Now if it was alrady encountered, it is not added a second time. This is especially important for Sentinel that rewrites the config at every state change.	2013-11-19 17:58:11 +01:00
antirez	37a51a2568	Sentinel: distinguish between is-master-down-by-addr requests. Some are just to know if the master is down, and in this case the runid in the request is set to "*", others are actually in order to seek for a vote and get elected. In the latter case the runid is set to the runid of the instance seeking for the vote.	2013-11-19 16:50:04 +01:00
antirez	b22d1beea0	Sentinel: various fixes to leader election implementation.	2013-11-19 16:20:42 +01:00
antirez	1f9728cb20	Sentinel: failover script execution fixed.	2013-11-19 12:34:46 +01:00
antirez	90635488ce	Sentinel: no longer used defines removed.	2013-11-19 11:24:36 +01:00
antirez	0a35f65301	Sentinel: when writing config on disk, remember sentinels runid.	2013-11-19 11:11:43 +01:00
antirez	5450833d02	Sentinel: arity of known-sentinel/slave is 4 not 3.	2013-11-19 11:03:47 +01:00
antirez	b8a94463b7	Sentinel: rewriteConfigSentinelOption() sub-iterators var typo fixed.	2013-11-19 10:59:50 +01:00
antirez	16237d78c8	Sentinel: call sentinelFlushConfig() to persist state when needed. Also the sentinel configuration rewriting was modified in order to account for failover in progress, where we need to provide the promoted slave address as master address, and the old master address as one of the slaves address.	2013-11-19 10:55:43 +01:00
antirez	e257ab2bfe	Sentinel: sentinelFlushConfig() to CONFIG REWRITE + fsync.	2013-11-19 10:13:04 +01:00
antirez	5998769c28	Sentinel: CONFIG REWRITE support for Sentinel config.	2013-11-19 09:48:12 +01:00
antirez	47df12d5d9	Sentinel: can-failover option removed, many comments fixed.	2013-11-19 09:28:47 +01:00
antirez	cd4ff9992b	Fix typo 'configuraiton' in rewriteConfigRewriteLine() comment.	2013-11-18 18:18:10 +01:00
antirez	232cdb95ab	Sentinel: added config options useful to take state on config rewrite. We'll use CONFIG REWRITE (internally) in order to store the new configuration of a Sentinel after the internal state changes. In order to do so, we need configuration options (that usually the user will not touch at all) about config epoch of the master, Sentinels and Slaves known for this master, and so forth.	2013-11-18 16:03:03 +01:00
antirez	3a374b0511	Sentinel: failover abort function simplified.	2013-11-18 11:43:35 +01:00
antirez	e0750acf11	Sentinel: slaves reconfig delay modified. The time Sentinel waits since the slave is detected to be configured to the wrong master, before reconfiguring it, is now the failover_timeout time as this makes more sense in order to give the Sentinel performing the failover enoung time to reconfigure the slaves slowly (if required by the configuration). Also we now PUBLISH more frequently the new configuraiton as this allows to switch the reapprearing master back to slave faster.	2013-11-18 11:37:24 +01:00
antirez	83316f515c	Sentinel: failover restart time is now multiple of failover timeout. Also defaulf failover timeout changed to 3 minutes as the failover is a fairly fast procedure most of the times, unless there are a very big number of slaves and the user picked to configure them sequentially (in that case the user should change the failover timeout accordingly).	2013-11-18 11:30:08 +01:00
antirez	3a56013acb	Sentinel: state machine and timeouts simplified.	2013-11-18 11:12:58 +01:00
antirez	4be53b1c5d	Sentinel: election timeout define.	2013-11-18 10:08:06 +01:00
antirez	69d826a354	Sentinel: fix address of master in Hello messages. Once we switched configuration during a failover, we should advertise the new address. This was a serious race condition as the Sentinel performing the failover for a moment advertised the old address with the new configuration epoch: once trasmitted to the other Sentinels the broken configuration would remain there forever, until the next failover (because a greater configuration epoch is required to overwrite an older one).	2013-11-14 10:25:55 +01:00
antirez	e4c65e72c6	Sentinel: master address selection in get-master-address refactored.	2013-11-14 10:23:54 +01:00
antirez	c0d7229364	Sentinel: fix conditional to only affect slaves with wrong master.	2013-11-14 10:23:05 +01:00
antirez	dfbd9c5aeb	Sentinel: simplify and refactor slave reconfig code.	2013-11-14 00:36:43 +01:00
antirez	64ad6648a8	Sentinel: reconfigure slaves to right master.	2013-11-14 00:29:38 +01:00
antirez	3e27d678da	Sentinel: remember last time slave changed master.	2013-11-14 00:20:15 +01:00
antirez	8297745fa6	Sentinel: redirect-to-master is not ok with new algorithm. Now Sentinel believe the current configuration is always the winner and should be applied by Sentinels instead of trying to adapt our view of the cluster based on what we observe. So the only way to modify what a Sentinel believe to be the truth is to win an election and advertise the new configuration via Pub / Sub with a greater configuration epoch.	2013-11-13 17:03:48 +01:00
antirez	76a88f56e5	Sentinel: safer slave reconfig, master reported role should match.	2013-11-13 17:02:09 +01:00
antirez	ddaad9fe2d	Sentinel: role reporting fixed and added in SENTINEL output.	2013-11-13 16:39:57 +01:00
antirez	a0afa66f4b	Sentinel: being a master and reporting as slave is considered SDOWN.	2013-11-13 16:28:56 +01:00
antirez	17718fdcba	Sentinel: make sure role_reported is always updated.	2013-11-13 16:21:58 +01:00
antirez	46a053d34b	Sentinel: track role change time. Wait before reconfigurations.	2013-11-13 16:18:23 +01:00
antirez	9e40c46f5e	Sentinel: fix no-down check in master->slave conversion code.	2013-11-13 13:43:59 +01:00
antirez	ae35b7e240	Sentinel: readd slaves back after a master reset.	2013-11-13 13:01:11 +01:00
antirez	6bd4f6bffe	Sentinel: sentinelResetMaster() new flag to avoid removing set of sentinels. This commit also removes some dead code and cleanup generic flags.	2013-11-13 10:30:45 +01:00
antirez	1569af1f23	Sentinel: receive Pub/Sub messages from slaves.	2013-11-12 23:07:33 +01:00
antirez	dfa5f8b777	Sentinel: change event name when converting master to slave.	2013-11-12 23:00:17 +01:00
antirez	24158d1488	Sentinel: added config-epoch to SENTINEL masters output.	2013-11-12 17:22:04 +01:00
antirez	d2bc6dc39a	Sentinel: new failover algo, desync slaves and update config epoch.	2013-11-12 17:07:31 +01:00
antirez	4a128b949d	Sentinel: when starting failover seek for votes ASAP.	2013-11-12 16:38:02 +01:00
antirez	e6b9d5e97e	Sentinel: +new-epoch events.	2013-11-12 13:35:25 +01:00
antirez	54c447be52	Sentinel: wait some time between failover attempts.	2013-11-12 13:30:31 +01:00
antirez	ab4b2ec88f	Sentinel: allow to vote for myself.	2013-11-12 11:32:40 +01:00
antirez	b6b65b29c0	Sentinel: fix PUBLISH to masters and slaves.	2013-11-12 11:12:48 +01:00
antirez	90ab62fd5e	Sentinel: epoch introduced in leader vote.	2013-11-12 11:09:35 +01:00
antirez	8c1bf9a2bd	Sentinel: leadership handling changes WIP. Changes to leadership handling. Now the leader gets selected by every Sentinel, for a specified epoch, when the SENTINEL is-master-down-by-addr is sent. This command now includes the runid and the currentEpoch of the instance seeking for a vote. The Sentinel only votes a single time in a given epoch. Still a work in progress, does not even compile at this stage.	2013-11-11 18:30:14 +01:00
antirez	0bac36d0a1	Sentinel: handle Hello messages received via slaves correctly. Even when messages are received via the slave, we should perform operations (like adding a new Sentinel) in the context of the master.	2013-11-11 17:12:27 +01:00
antirez	9e1b27d49e	Sentinel: remove code not useful in the new design.	2013-11-11 12:06:11 +01:00
antirez	b93b0adc89	Sentinel: epoch introduced. Sentinel state now includes the idea of current epoch and config epoch. In the Hello message, that is now published both on masters and slaves, a Sentinel no longer just advertises itself but also broadcasts its current view of the configuration: the master name / ip / port and its current epoch. Sentinels receiving such information switch to the new master if the configuration epoch received is newer and the ip / port of the master are indeed different compared to the previos ones.	2013-11-11 11:05:58 +01:00
antirez	b2f834390c	Log to what master a slave is going to connect to.	2013-11-11 09:25:36 +01:00
antirez	a5e7358a12	Cluster: removed not needed newline at end of redisLog() msg.	2013-11-08 17:28:02 +01:00
antirez	28071caf38	Cluster: send a single UPDATE packet for now.	2013-11-08 17:25:49 +01:00
antirez	d289c628b1	Cluster: replace hardcoded 4096 for bus msg len with sizeof().	2013-11-08 17:19:19 +01:00
antirez	94a07d5901	Cluster: slots update refactored + UPDATE msg processing. Now there is a function that handles the update of the local slot configuration every time we have some new info about a node and its set of served slots and configEpoch. Moreoever the UPDATE packets are now processed when received (it was a work in progress in the previous commit).	2013-11-08 17:02:10 +01:00
antirez	dc43f66eac	Cluster: UPDATE msg data structure and sending function.	2013-11-08 16:26:50 +01:00
antirez	6c6572be95	Cluster: refactoring of slots update code and more. The commit also introduces detection of nodes publishing not updated configuration. More work in progress to send an UPDATE packet to inform of the config change.	2013-11-08 10:32:16 +01:00
antirez	d3588dc194	Fix broken rdbWriteRaw() return value check in rdb.c. Thanks to @PhoneLi for reporting.	2013-11-07 23:53:18 +01:00
antirez	d75ab87226	redis-trib: fixed slot allocation when --replicas is used.	2013-11-07 16:12:06 +01:00
antirez	80da056c29	Sentinel: sentinelSendSlaveOf() was missing a var and the prototype.	2013-11-06 11:23:53 +01:00
antirez	23800d9e49	Sentinel: increment pending_commands counter in two more places. AUTH and SCRIPT KILL were sent without incrementing the pending commands counter. Clearly this needs some kind of wrapper doing it for the caller in order to be less bug prone.	2013-11-06 11:21:44 +01:00
antirez	671c1dfb56	Sentinel: always send CONFIG REWRITE when changing instance role. This change makes Sentinel less fragile about a number of failure modes. This commit also fixes a different bug as a side effect, SLAVEOF command was sent multiple times without incrementing the pending commands count.	2013-11-06 11:13:27 +01:00
antirez	ebcb6251e6	SCAN code refactored to parse cursor first. The previous implementation of SCAN parsed the cursor in the generic function implementing SCAN, SSCAN, HSCAN and ZSCAN. The actual higher-level command implementation only checked for empty keys and return ASAP in that case. The result was that inverting the arguments of, for instance, SSCAN for example and write: SSCAN 0 key Instead of SSCAN key 0 Resulted into no error, since 0 is a non-existing key name very likely. Just the iterator returned no elements at all. In order to fix this issue the code was refactored to extract the function to parse the cursor and return the error. Every higher level command implementation now parses the cursor and later checks if the key exist or not.	2013-11-05 15:47:50 +01:00
antirez	b4048dfec0	SCAN: when iterating ziplists or intsets always return cursor of 0. The previous implementation assumed that the first call always happens with cursor set to 0, this may not be the case, and we want to return 0 anyway otherwise the (broken) client code will loop forever.	2013-11-05 15:32:25 +01:00
antirez	101d4bf867	Use strtoul() instead of sscanf() in SCAN implementation.	2013-11-05 15:30:21 +01:00
antirez	f56f78d159	HSCAN/ZSCAN: skip value when matching. This fixes issue #1360 and #1362.	2013-11-05 12:16:29 +01:00
antirez	1a0cea33a0	Cluster: initialize senderConfigEpoch and senderCurrentEpoch for warnings suppression.	2013-11-05 12:01:07 +01:00
antirez	eb95d28898	Pass int64_t to intsetGet() instead of long long.	2013-11-05 11:57:30 +01:00
antirez	20fb91fd31	removed not used vars in dictScan().	2013-11-05 11:56:11 +01:00
antirez	6cf230ea91	Initial support for --replicas in redis-trib.	2013-11-05 11:24:24 +01:00
antirez	c4ca5f99bf	Inverted variable boolean value and name after scanGenericCommand() refactoring.	2013-10-31 10:35:56 +01:00
antirez	8fc85a1218	scanGenericCommand() refactoring and handling of integer encoded elements. This commit fixes issue #1354.	2013-10-31 10:32:39 +01:00
antirez	3c7a0677b0	redis-benchmark: update help for new __rand_int__ form.	2013-10-28 18:13:42 +01:00
antirez	9b2d44e63f	Aesthetic fix (missing space) into HSCAN and ZSCAN implementations. Thanks to @badboy for reporting.	2013-10-28 13:20:11 +01:00
antirez	2c643ffa8d	ZSCAN implemented.	2013-10-28 11:36:42 +01:00
antirez	e50090aa06	HSCAN implemented.	2013-10-28 11:35:26 +01:00
antirez	4a1f1cc0d7	SSCAN implemented.	2013-10-28 11:17:32 +01:00
antirez	dfeaa84d46	dictScan(): empty hash table requires special handling.	2013-10-28 11:17:18 +01:00
antirez	cd8cb49dc4	SCAN is a random command and does not require output sorting. Sorting the output helps when we want to turn a non-deterministic into a deterministic command, in that case this is not possible.	2013-10-28 11:13:43 +01:00
antirez	442ae833b2	SCAN: refactored into scanGenericCommand. The new implementation is capable of iterating the keyspace but also sets, hashes, and sorted sets, and can be used to implement SSCAN, ZSCAN and HSCAN.	2013-10-28 11:11:34 +01:00
antirez	7bd45659b9	Fixed typos in dictScan() comment.	2013-10-25 17:05:55 +02:00
antirez	34c207227c	dictScan() algorithm documented.	2013-10-25 17:01:30 +02:00
antirez	48ea8a0a11	SCAN: stay inside 80 cols.	2013-10-25 12:01:49 +02:00
antirez	df13adb037	Revert "Fixed typo in SCAN comment. iff -> if." Probably here Pieter means "if and only if". This reverts commit `43fdf3b404`.	2013-10-25 12:00:13 +02:00
antirez	a25fe0b28d	SCAN: simplify keys list cleanup using listSetFreeMethod().	2013-10-25 11:58:03 +02:00
antirez	fd1b0ad07c	SCAN: improve variable names for readability.	2013-10-25 11:54:45 +02:00
antirez	908eba5a8f	SCAN: remove additional newlines to conform to Redis code base.	2013-10-25 11:51:08 +02:00
antirez	6e55e543c6	SCAN: remove useless assertion, already enforced by command table.	2013-10-25 11:49:08 +02:00
antirez	64722b0946	SCAN: use define REDIS_LONGSTR_SIZE instead of fixed len.	2013-10-25 11:48:18 +02:00
antirez	43fdf3b404	Fixed typo in SCAN comment. iff -> if.	2013-10-25 11:46:02 +02:00
antirez	9c88ace927	SCAN option name changed: pattern -> match.	2013-10-25 11:45:32 +02:00
Pieter Noordhuis	b63fbea5e4	Fix error in scan algorithm The irrelevant bits shouldn't be masked to 1. This can result in slots being skipped when the hash table is resized between calls to the iterator.	2013-10-25 10:50:03 +02:00
Pieter Noordhuis	7a6cfb18f3	SCAN requires at least 1 argument	2013-10-25 10:49:56 +02:00
Pieter Noordhuis	7f490b197f	Add SCAN command	2013-10-25 10:49:48 +02:00
antirez	85f52ebcd8	Cluster: rough support for sub-command options in redis-trib.	2013-10-11 17:33:19 +02:00
antirez	0c9f60a628	Cluster: there is a lower limit for the handshake timeout.	2013-10-11 10:34:32 +02:00
antirez	1447d28c0f	Cluster: data_age conversion to milliseconds fixed.	2013-10-09 16:36:06 +02:00
antirez	573c2fea91	Cluster: clusterCron() freq is now 10h. Still ping 1 node every sec. After the change in clusterCron() frequency of call, we still want to ping just one random node every second.	2013-10-09 16:29:17 +02:00
antirez	ba42428633	Cluster: time switched from seconds to milliseconds. All the internal state of cluster involving time is now using mstime_t and mstime() in order to use milliseconds resolution. Also the clusterCron() function is called with a 10 hz frequency instead of 1 hz. The cluster node_timeout must be also configured in milliseconds by the user in redis.conf.	2013-10-09 16:19:26 +02:00
antirez	929b6a4480	Cluster: cluster stuff moved from redis.h to cluster.h.	2013-10-09 15:38:05 +02:00
antirez	6fa9b1a420	Merge branch 'bettercluster' into unstable	2013-10-08 13:04:33 +02:00
antirez	ae2763f564	Cluster: masters don't vote for a slave with stale config. When a slave requests our vote, the configEpoch he claims for its master and the set of served slots must be greater or equal to the configEpoch of the nodes serving these slots in the current configuraiton of the master granting its vote. In other terms, masters don't vote for slaves having a stale configuration for the slots they want to serve.	2013-10-08 12:45:35 +02:00
antirez	f7d6ad4366	Cluster: fix slave data age computation when master is still connected.	2013-10-07 16:07:13 +02:00
antirez	2c3301b9f5	Cluster: log message improved when FAIL is cleared from a slave node.	2013-10-07 15:44:58 +02:00
antirez	72f38cd70f	Cluster: slave nodes advertise master slots bitmap and configEpoch.	2013-10-07 11:31:12 +02:00
antirez	0150c70b2b	Replication: install the write handler when reusing a cached master. Sometimes when we resurrect a cached master after a successful partial resynchronization attempt, there is pending data in the output buffers of the client structure representing the master (likely REPLCONF ACK commands). If we don't reinstall the write handler, it will never be installed again by addReply*() family functions as they'll assume that if there is already data pending, the write handler is already installed. This bug caused some slaves after a successful partial sync to never send REPLCONF ACK, and continuously being detected as timing out by the master, with a disconnection / reconnection loop.	2013-10-04 16:14:54 +02:00
antirez	1461422ce6	Replication: install the write handler when reusing a cached master. Sometimes when we resurrect a cached master after a successful partial resynchronization attempt, there is pending data in the output buffers of the client structure representing the master (likely REPLCONF ACK commands). If we don't reinstall the write handler, it will never be installed again by addReply*() family functions as they'll assume that if there is already data pending, the write handler is already installed. This bug caused some slaves after a successful partial sync to never send REPLCONF ACK, and continuously being detected as timing out by the master, with a disconnection / reconnection loop.	2013-10-04 16:12:25 +02:00
antirez	6d8c2a4848	Replication: fix master timeout. Since we started sending REPLCONF ACK from slaves to masters, the lastinteraction field of the client structure is always refreshed as soon as there is room in the socket output buffer, so masters in timeout are detected with too much delay (the socket buffer takes a lot of time to be filled by small REPLCONF ACK <number> entries). This commit only counts data received as interactions with a master, solving the issue.	2013-10-04 13:01:45 +02:00
antirez	b41570f719	Replication: fix master timeout. Since we started sending REPLCONF ACK from slaves to masters, the lastinteraction field of the client structure is always refreshed as soon as there is room in the socket output buffer, so masters in timeout are detected with too much delay (the socket buffer takes a lot of time to be filled by small REPLCONF ACK <number> entries). This commit only counts data received as interactions with a master, solving the issue.	2013-10-04 12:59:24 +02:00
antirez	d62ae1ec05	PSYNC: safer handling of PSYNC requests. There was a bug that over-esteemed the amount of backlog available, however this could only happen when a slave was asking for an offset that was in the "future" compared to the master replication backlog. Now this case is handled well and logged as an incident in the master log file.	2013-10-04 12:27:30 +02:00
antirez	4f9a69399b	Add REWRITE to CONFIG subcommands help message.	2013-10-04 12:27:26 +02:00
antirez	37e06bd952	PSYNC: safer handling of PSYNC requests. There was a bug that over-esteemed the amount of backlog available, however this could only happen when a slave was asking for an offset that was in the "future" compared to the master replication backlog. Now this case is handled well and logged as an incident in the master log file.	2013-10-04 12:25:09 +02:00
antirez	7afc0dd59a	Cluster: new clusterDoBeforeSleep() API. The new API is able to remember operations to perform before returning to the event loop, such as checking if there is the failover quorum for a slave, save and fsync the configuraiton file, and so forth. Because this operations are performed before returning on the event loop we are sure that messages that are sent in the same event loop run will be delivered after the configuration is already saved, that is a requirement sometimes. For instance we want to publish a new epoch only when it is already stored in nodes.conf in order to avoid returning back in the logical clock when a node is restarted. This new API provides a big performance advantage compared to saving and possibly fsyncing the configuration file multiple times in the same event loop run, especially in the case of big clusters with tens or hundreds of nodes.	2013-10-03 09:58:06 +02:00
antirez	211dcbe339	Cluster: update cluster config when slave changes master.	2013-10-02 12:27:12 +02:00
antirez	6c4d904baf	Cluster: bus messages stats in CLUSTER info.	2013-10-02 10:10:08 +02:00
antirez	abe81781ae	Cluster: FAIL messages from unknown senders are handled better. Previously the event was not logged but instead the node reported an unknown packet type received.	2013-10-02 09:42:45 +02:00
antirez	7970ebd80a	Cluster: senderCurrentEpoch == node currentEpoch was too strict. We can accept a vote as long as its epoch is >= the epoch at which we started the voting process. There is no need for it to be exactly the same.	2013-10-01 17:21:28 +02:00
antirez	f1bfd8233b	Cluster: fix typo in clusterProcessPacket() comment.	2013-10-01 15:40:20 +02:00
antirez	1dedf9aa36	Cluster: time field removed from cluster messages header. The new algorithm does not check replies time as checking for the currentEpoch in the reply ensures that the reply is about the current election process.	2013-09-30 16:19:44 +02:00
antirez	2b93a19537	Add REWRITE to CONFIG subcommands help message.	2013-09-30 11:53:18 +02:00
antirez	2d0844ee37	Cluster: log message shortened.	2013-09-30 11:51:58 +02:00
antirez	707ff0f714	Make clear that runids are not cluster node IDs.	2013-09-30 11:48:09 +02:00
antirez	4dc247eb31	Cluster: detect cluster reconfiguration when master slots drop to 0. The old algorithm used a PROMOTED flag and explicitly checks about slave->master convertions. Wit the new cluster meta-data propagation algorithm we just look at the configEpoch to check if we need to reconfigure slots, then: 1) If a node is a master but it reaches zero served slots becuase of reconfiguration. 2) If a node is a slave but the master reaches zero served slots because of a reconfiguration. We switch as a replica of the new slots owner.	2013-09-30 11:45:26 +02:00
antirez	62b1591439	Cluster: re-order failover operations to make it safer. We need to: 1) Increment the configEpoch. 2) Save it to disk and fsync the file. 3) Broadcast the PONG with the new configuration. If other nodes will receive the updated configuration we need to be sure to restart with this new config in the event of a crash.	2013-09-30 10:16:48 +02:00
antirez	b187517719	Cluster: when upading the configEpoch for a node, save config on disk ASAP.	2013-09-30 10:16:25 +02:00
antirez	03ca903983	Cluster: fsync data when saving the cluster config.	2013-09-30 10:13:07 +02:00
antirez	026e63392e	Cluster: update the node configEpoch when newer is detected.	2013-09-27 09:55:41 +02:00
antirez	7c4b8f29e7	Cluster: react faster when a slave wins an election.	2013-09-26 16:54:43 +02:00
antirez	42fa46e49a	Cluster: removed an old source of delay to start the slave failover.	2013-09-26 13:28:19 +02:00
antirez	a445aa30a0	Cluster: master node now uses new protocol to vote.	2013-09-26 13:00:41 +02:00
antirez	fb9b76fe14	Cluster: slave node now uses the new protocol to get elected.	2013-09-26 11:13:17 +02:00
Michel Martens	347ab78e90	Document the redis-cli --csv option.	2013-09-26 10:12:46 +02:00
antirez	656c3ffe4a	Cluster: fix redis-trib node config fingerprinting for new nodes format.	2013-09-25 12:58:06 +02:00
antirez	341ed1d1a8	Cluster: fix redis-trib for added configEpoch field in CLUSTER NODES.	2013-09-25 12:44:56 +02:00
antirez	32b5410af9	Cluster: add currentEpoch to CLUSTER INFO.	2013-09-25 12:38:36 +02:00
antirez	6ec795d2cf	Cluster: update our currentEpoch when a greater one is seen.	2013-09-25 12:36:29 +02:00
antirez	d426ada891	Cluster: broadcast currentEpoch and configEpoch in packets header.	2013-09-25 11:53:35 +02:00
antirez	12483b0061	Cluster: configEpoch added in cluster nodes description.	2013-09-25 11:47:13 +02:00
antirez	da257afe57	htonu64() and ntohu64 added to endianconv.h.	2013-09-25 09:26:36 +02:00
antirez	3c9bb8751a	Cluster: PFAIL -> FAIL transition allowed for slaves. First change: now there is no need to be a master in order to detect a failure, however the majority of masters signaling PFAIL or FAIL is needed. This change is important because it allows slaves rejoining the cluster after a partition to sense the FAIL condition so that eventually all the nodes agree on failures.	2013-09-20 11:26:44 +02:00

1 2 3 4 5 ...

2210 Commits