redis

Commit Graph

Author	SHA1	Message	Date
antirez	34c1871e9f	Cluster: set node role on successful handshake.	2013-03-25 13:03:01 +01:00
antirez	cda0cdfb70	redis-trib: Don't use colorization if TERM != xterm.	2013-03-25 12:51:53 +01:00
antirez	4286d8b979	redis-trib: initial output colorization	2013-03-25 12:50:38 +01:00
antirez	09aa55a334	redis-cli --stat, stolen from redis-tools. Redis-tools is a connection of tools no longer mantained that was intented as a way to economically make sense of Redis in the pre-vmware sponsorship era. However there was a nice redis-stat utility, this commit imports one of the functionalities of this tool here in redis-cli as it seems to be pretty useful. Usage: redis-cli --stat The output is similar to vmstat in the format, but with Redis specific stuff of course. From the point of view of the monitored instance, only INFO is used in order to grab data.	2013-03-22 17:54:32 +01:00
antirez	c195289e5e	redis-trib: All output wrapped by a specific function. This is needed in order to colorize it as next step. We use conventions in output messages such as >>> This is an action *** This is a warning [ERR] This is an error [OK] That's fine And so forth, so that a color will be associated checking the first three chars.	2013-03-22 17:39:43 +01:00
antirez	be7bdd376e	redis-trib: fix open slot correction. Slot zero was hardcoded (!)	2013-03-22 13:03:33 +01:00
antirez	813d7cbdd1	redis-trib: added cluster_error method to add errors.	2013-03-22 12:59:18 +01:00
antirez	d7eae8d8d7	redis-trib: fixed ClusterNode migrating/importing slots detection.	2013-03-22 12:54:04 +01:00
antirez	1baa153028	redis-trib: added some more output for check.	2013-03-22 12:47:49 +01:00
antirez	77bba91b48	redis-trib: fixed type has_flags? -> has_flag.	2013-03-22 12:28:06 +01:00
antirez	4dded0c187	redis-trib: ignore slaves when resharding.	2013-03-21 18:17:06 +01:00
antirez	47da76576e	redis-trib: fix conditional otherwise always true.	2013-03-21 17:22:14 +01:00
antirez	a89d435d8e	Cluster: move slotToKeyFlush() to emptyDb(). This way we are sure to destroy the slot->key map every time we destroy the DB, for instance when reloading a DB due to replication.	2013-03-21 17:13:08 +01:00
antirez	ed47f77977	redis-trib: initial support to fix "open" slots. Open slots are slots found in importing or migrating slot when a cluster check is performed.	2013-03-21 17:11:54 +01:00
antirez	1a6df1049d	redis-trib: load info about importing/migrating slots from node.	2013-03-21 16:31:53 +01:00
antirez	a8b09faf3d	Cluster: comment no longer in sync with code removed.	2013-03-21 10:47:10 +01:00
Johan Bergström	8080e1cfbe	use `install` as default installer (except on SunOS)	2013-03-21 13:32:08 +11:00
antirez	8c1bc8e865	Cluster: clear the PROMOTED slave directly into clusterSetMaster(). This way we make sure every time a master is turned into a replica the flag will be cleared.	2013-03-20 11:51:44 +01:00
antirez	e006407fd0	Cluster: master node must clear its hash slots when turning into a slave. When a master turns into a slave after a failover event, make sure to clear the assigned slots before setting up the replication, as a slave should never claim slots in an explicit way, but just take over the master slots when replacing its master.	2013-03-20 11:32:35 +01:00
antirez	506f9a42b0	Cluster: new flag PROMOTED introduced. A slave node set this flag for itself when, after receiving authorization from the majority of nodes, it turns itself into a master. At the same time now this flag is tested by nodes receiving a PING message before reconfiguring after a failover event. This makes the system more robust: even if currently there is no way to manually turn a slave into a master it is possible that we'll have such a feature in the future, or that simply because of misconfiguration a node joins the cluster as master while others believe it's a slave. This alone is now no longer enough to trigger reconfiguration as other nodes will check for the PROMOTED flag. The PROMOTED flag is cleared every time the node is turned back into a replica of some other node.	2013-03-20 10:48:42 +01:00
antirez	026b9483db	Cluster: add sender flags in cluster bus messages header. Sender flags were not propagated for the sender, but only for nodes in the gossip section. This is odd and in the next commits we'll need to get updated flags for the sender node, so this commit adds a new field in the cluster messages header. The message header is the same size as we reused some free space that was marked as 'unused' because of alignment concerns.	2013-03-20 10:32:00 +01:00
antirez	d15b027d91	Cluster: turn old master into a replica of node that failed over. So when the failing master node is back in touch with the cluster, instead of remaining unused it is converted into a replica of the new master, ready to perform the fail over if the new master node will fail at some point. Note that as a side effect clients with stale configuration are now not an issue as well, as the node converted into a slave will not accept queries but will redirect clients accordingly.	2013-03-20 00:30:47 +01:00
antirez	4d62623015	Cluster: node replication role change handle improved. The code handling a master that turns into a slave or the contrary was improved in order to avoid repeating the same operations. Also the readability and conceptual simplicity was improved.	2013-03-19 16:01:30 +01:00
antirez	88221f88c0	Cluster: new command CLUSTER FLUSHSLOTS. It's just a simpler way to CLUSTER DELSLOTS with all the slots as arguments, in order to obtain a node without assigned slots for reconfiguration.	2013-03-19 09:58:05 +01:00
antirez	7d3e32d526	redis-trib: don't load cluster config from nodes in FAIL state.	2013-03-19 09:46:12 +01:00
Johan Bergström	dc4003be71	Silence mkdir output	2013-03-17 18:37:38 +11:00
Johan Bergström	348be19b5f	Only pass -rdynamic as linker option	2013-03-17 17:49:57 +11:00
Johan Bergström	33a4bc2c70	Remove extra spaces	2013-03-17 17:23:45 +11:00
Johan Bergström	978c895b69	make check is a common naming convention for tests	2013-03-16 18:40:22 +11:00
Johan Bergström	ada7aa7ac9	Spaces to tabs	2013-03-16 18:35:20 +11:00
Johan Bergström	bea60bec75	Slightly refactor CFLAGS/LDFLAGS/LIBS This way, we can avoid -rdynamic and -pthread warnings on darwin.	2013-03-16 18:33:42 +11:00
antirez	e28e61e839	Cluster: when failing over claim master slots.	2013-03-15 16:53:41 +01:00
antirez	b8127e337a	Version incremented to 2.9.8 after major cluster progresses.	2013-03-15 16:45:45 +01:00
antirez	dd091661d4	Cluster: log when a slave asks for failover authorization.	2013-03-15 16:44:08 +01:00
antirez	1375b0611b	Cluster: slaves start failover with a small delay. Redis Cluster can cope with a minority of nodes not informed about the failure of a master in time for some reason (netsplit or node not functioning properly, blocked, ...) however to wait a few seconds before to start the failover will make most "normal" failovers simpler as the FAIL message will propagate before the slave election happens.	2013-03-15 16:39:49 +01:00
antirez	d512a09c20	Cluster: a bit more serious node role change handling.	2013-03-15 16:35:16 +01:00
antirez	004fbef847	Cluster: remove node from master slaves when it turns into a master. Also, a few nearby comments improved.	2013-03-15 16:16:19 +01:00
antirez	44c92f5aeb	Cluster: slave failover implemented.	2013-03-15 16:11:34 +01:00
antirez	1d8f302e0d	Cluster: election -> promotion in two comments.	2013-03-15 15:44:49 +01:00
antirez	bf82195467	Cluster: added function to broadcast pings. See the function top-comment for info why this is useful sometimes.	2013-03-15 15:43:58 +01:00
antirez	892e98548a	Cluster: don't broadcast messages to HANDSHAKE nodes. Also don't check for NOADDR as we check that node->link is not NULL that's enough.	2013-03-15 15:36:36 +01:00
antirez	76a3954f4a	Cluster: fix clusterHandleSlaveFailover() conditional: quorum is enough.	2013-03-15 13:20:34 +01:00
antirez	90e99a2082	Cluster: two lame bugs fixed in FAILOVER AUTH messages generation.	2013-03-14 21:27:12 +01:00
antirez	aeacaa57e6	Cluster: code to process messages moved in the right if-else chain.	2013-03-14 21:21:58 +01:00
antirez	35f05c66b6	Cluster: handle FAILOVER_AUTH_ACK messages. That's trivial as we just need to increment the count of masters that received with an ACK.	2013-03-14 16:43:13 +01:00
antirez	c2595500ac	Cluster: request failover authorization, log if we have quorum. However the failover is yet not really performed.	2013-03-14 16:39:02 +01:00
antirez	7fa42b801d	Cluster: clusterSendFailoverAuth() implementation.	2013-03-14 16:31:57 +01:00
NanXiao	79a13b46fb	Update config.c Fix bug in configGetCommand function: get correct masterauth value.	2013-03-14 13:52:43 +08:00
antirez	f59ff6fe61	Cluster: clusterSendFailoverAuthIfNeeded() work in progress.	2013-03-13 19:08:03 +01:00
antirez	44f6fdab60	Cluster: handle FAILOVER_AUTH_REQUEST in clusterProcessPacket(). However currently the control is passed to a function doing nothing at all.	2013-03-13 18:38:08 +01:00
antirez	ece95b2dea	Cluster: sanity check FAILOVER_AUTH_REQUEST messages for proper length.	2013-03-13 17:31:26 +01:00
antirez	66144337bf	Cluster: use 'else if' for mutually exclusive conditionals.	2013-03-13 17:27:06 +01:00
antirez	db7c17e969	Cluster: FAILOVER_AUTH_REQUEST message type introduced. This message is sent by a slave that is ready to failover its master to other nodes to get the authorization from the majority of masters.	2013-03-13 17:21:20 +01:00
antirez	575cbc9990	Cluster: clusterHandleSlaveFailover() stub.	2013-03-13 13:10:49 +01:00
antirez	1902a9c532	Replication: master_link_down_since_seconds initial value should be huge. server.repl_down_since used to be initialized to the current time at startup. This is wrong since the replication never started. Clients testing this filed to check if data is uptodate should never believe data is recent if we never ever connected to our master.	2013-03-13 12:54:48 +01:00
antirez	3d448bda39	Cluster: call clusterHandleSlaveFailover() when our master is down.	2013-03-13 12:44:02 +01:00
antirez	79a6844e44	rdbLoad(): rework code to save vertical space.	2013-03-12 19:46:33 +01:00
Salvatore Sanfilippo	9925c7c670	Merge pull request #1001 from djanowski/fatal-errors-rdb-load Abort when opening the RDB file results in an error other than ENOENT.	2013-03-12 11:40:36 -07:00
Damian Janowski	4178a80282	Abort when opening the RDB file results in an error other than ENOENT. This fixes cases where the RDB file does exist but can't be accessed for any reason. For instance, when the Redis process doesn't have enough permissions on the file.	2013-03-12 14:37:50 -03:00
antirez	215bfaea16	Set default for stop_writes_on_bgsave_err in initServerConfig(). It was placed for error in initServer() that's called after the configuation is already loaded, causing issue #1000.	2013-03-12 18:34:08 +01:00
antirez	91d3b487e7	redis-cli --bigkeys: don't crash with empty DBs.	2013-03-12 09:58:00 +01:00
antirez	2d851333a6	activeExpireCycle() smarter with many DBs and under expire pressure. activeExpireCycle() tries to test just a few DBs per iteration so that it scales if there are many configured DBs in the Redis instance. However this commit makes it a bit smarter when one a few of those DBs are under expiration pressure and there are many many keys to expire. What we do is to remember if in the last iteration had to return because we ran out of time. In that case the next iteration we'll test all the configured DBs so that we are sure we'll test again the DB under pressure. Before of this commit after some mass-expire in a given DB the function tested just a few of the next DBs, possibly empty, a few per iteration, so it took a long time for the function to reach again the DB under pressure. This resulted in a lot of memory being used by already expired keys and never accessed by clients.	2013-03-11 11:10:33 +01:00
antirez	08b107e405	In databasesCron() never test more DBs than we have.	2013-03-11 10:51:03 +01:00
antirez	4b1ccdfd49	Make comment name match var name in activeExpireCycle().	2013-03-11 10:42:14 +01:00
antirez	1f7d2c1e27	Optimize inner loop of activeExpireCycle() for no-expires case.	2013-03-09 11:48:54 +01:00
antirez	5f5aa487f9	REDIS_DBCRON_DBS_PER_SEC -> REDIS_DBCRON_DBS_PER_CALL	2013-03-09 11:44:20 +01:00
antirez	db29d71a30	activeExpireCycle(): process only a small number of DBs per iteration. This small number of DBs is set to 16 so actually in the default configuraiton Redis should behave exactly like in the past. However the difference is that when the user configures a very large number of DBs we don't do an O(N) operation, consuming a non trivial amount of CPU per serverCron() iteration.	2013-03-08 17:48:58 +01:00
antirez	40a2da159c	Use unsigned integers for DB ids, for defined wrap-to-zero.	2013-03-08 17:41:20 +01:00
antirez	7ac3b3a486	Only resize/rehash a few databases per cron iteration. This is the first step to lower the CPU usage when many databases are configured. The other is to also process a limited number of DBs per call in the active expire cycle.	2013-03-08 14:01:12 +01:00
antirez	dfd732dff3	Actually call databasesCron() inside serverCron().	2013-03-08 13:59:50 +01:00
antirez	cd9dcd1835	Move Redis databases background processing to databasesCron().	2013-03-08 12:34:05 +01:00
antirez	f0b807cd47	Cluster: update cluster state on PFAIL flag set/cleared on nodes.	2013-03-07 15:40:53 +01:00
antirez	299b8f76c2	Cluster: mark cluster state as fail of majority of masters is unreachable.	2013-03-07 15:36:59 +01:00
antirez	abf06fd5ff	Cluster: log global cluster state change.	2013-03-07 15:22:32 +01:00
antirez	3dad8196b7	Cluster: clusterUpdateState() function simplified. Also the NEEDHELP Cluster state was removed as it will no longer be used by Redis Cluster.	2013-03-06 18:25:40 +01:00
Gengliang Wang	042ed270c8	Removed useless "return" statements in pubsub.c (original commit message edited)	2013-03-06 16:49:20 +01:00
antirez	7b190a08cf	API to lookup commands with their original name. A new server.orig_commands table was added to the server structure, this contains a copy of the commant table unaffected by rename-command statements in redis.conf. A new API lookupCommandOrOriginal() was added that checks both tables, new first, old later, so that rewriteClientCommandVector() and friends can lookup commands with their new or original name in order to fix the client->cmd pointer when the argument vector is renamed. This fixes the segfault of issue #986, but does not fix a wider range of problems resulting from renaming commands that actually operate on data and are registered into the AOF file or propagated to slaves... That is command renaming should be handled with care.	2013-03-06 16:28:26 +01:00
antirez	bfa25441e7	Handle a non-impossible empty argv in loadServerConfigFromString(). Usually this does not happens since we trim for " \t\r\n", but if there are other chars that return true with isspace(), we may end with an empty argv. Better to handle the condition in an explicit way.	2013-03-06 12:40:48 +01:00
antirez	8c193af696	redis-cli: use sdsfreesplitres() instead of hand-coding it.	2013-03-06 12:38:32 +01:00
antirez	011fa89ac9	Cluster: sdssplitargs_free() -> sdsfreesplitres().	2013-03-06 12:38:06 +01:00
antirez	729a3432ba	sds.c: sdssplitargs_free() removed as it was a duplicate.	2013-03-06 12:38:06 +01:00
antirez	cf4d7737bb	More specific error message in loadServerConfigFromString().	2013-03-06 12:24:12 +01:00
antirez	4ea89e64c0	sdssplitargs(): on error set *argc to 0. This makes programs not checking the return value for NULL much safer since with this change: 1) It is still possible to iterate the zero-length result without crashes. 2) sdssplitargs_free will work against NULL and 0 count.	2013-03-06 12:21:31 +01:00
antirez	5cabae84e6	sdssplitargs(): now returns NULL only on error. An empty input string also resulted into the function returning NULL making it harder for the caller to distinguish between error and empty string without checking the original input string length.	2013-03-06 12:21:21 +01:00
charsyam	1303f02be6	Don't segfault on unbalanced quotes.	2013-03-06 11:54:02 +01:00
antirez	304ef5e283	Allow AUTH while loading the DB in memory. While Redis is loading the AOF or RDB file in memory only a subset of commands are allowed. This commit adds AUTH to this subset.	2013-03-06 11:50:38 +01:00
antirez	1025dd7786	Cluster: connect to our master ASAP after startup if we are a slave node.	2013-03-05 16:12:08 +01:00
antirez	bac57ad14b	Cluster: more robust FAIL flag cleaup. If we have a master in FAIL state that's reachable again, and apparently no one is going to serve its slots, clear the FAIL flag and let the cluster continue with its operations again.	2013-03-05 15:05:32 +01:00
antirez	1a02b7440a	Cluster: new node field fail_time. This is the unix time at which we set the FAIL flag for the node. It is only valid if FAIL is set. The idea is to use it in order to make the cluster more robust, for instance in order to revert a FAIL state if it is long-standing but still slots are assigned to this node, that is, no one is going to fix these slots apparently.	2013-03-05 13:15:05 +01:00
antirez	d3b4662347	Cluster: don't check keys hash slots when the source is our master. Usually we redirect clients to the right hash slot, however we don't want to do that with our master, we want just to mirror it.	2013-03-05 13:02:44 +01:00
antirez	31ac376051	Cluster: slaveof not allowed in redis.conf as well.	2013-03-05 12:58:22 +01:00
antirez	b7d085fc0d	Cluster: SLAVEOF command not allowed in cluster mode.	2013-03-05 12:39:41 +01:00
antirez	e4b481a5f6	Cluster: A comment updated in clusterCron().	2013-03-05 12:17:30 +01:00
antirez	d728ec6dee	Cluster: send a ping to every node we never contacted in timeout/2 seconds. Usually we try to send just 1 ping every second, however when we detect we are going to have unreliable failure detection because we can't ping some node in time, send an additional ping. This should only happen with very large clusters or when the the node timeout is set to a very low value.	2013-03-05 12:16:02 +01:00
antirez	e7628be2a7	Cluster: set node->slaveof correctly when a node state is updated.	2013-03-05 11:50:11 +01:00
antirez	d6457577d4	Cluster: don't perform startup slots sanity check for slaves. If we are a cluster node the DB content will not match our configured slots. Don't do the check at all.	2013-03-04 19:47:00 +01:00
antirez	d334897e80	Cluster: fix maximum line length when loading config. There are pathological cases where the line can be even longer a single node may contain all the slots in importing/migrating state.	2013-03-04 19:45:36 +01:00
antirez	3be893123f	Make sure replicationSetMaster() works when ip argument is not an sds.	2013-03-04 15:39:55 +01:00
antirez	b8a28bf442	Cluster: actually setup replication in CLUSTER REPLICATE.	2013-03-04 15:27:58 +01:00
antirez	7bead003e2	SLAVEOF command refactored into a proper API. We now have replicationSetMaster() and replicationUnsetMaster() that can be called in other contexts (for instance Redis Cluster).	2013-03-04 13:22:21 +01:00

1 2 3 4 5 ...

1749 Commits