vitalif/etcd - etcd

Commit Graph

Author	SHA1	Message	Date
Yicheng Qin	5d906a0acc	etcdserver: restore v3 storage when restart To load the previous data.	2015-09-29 00:14:27 -07:00
Yicheng Qin	939aa96a34	etcdmain: improve log when join discovery fails Before this PR, the log is ``` 2015/09/1 13:18:31 etcdmain: client: etcd cluster is unavailable or misconfigured ``` It is quite hard for people to understand what happens. Now we print out the exact reason for the failure, and explains the way to handle it.	2015-09-28 23:23:50 -07:00
Xiang Li	6c05a01ec6	Merge pull request #3604 from gyuho/replace_netutil_BasicAuth etcdhttp/auth: BasicAuth method in standard pkg	2015-09-28 15:55:46 -07:00
Gyu-Ho Lee	e16f81838b	etcdhttp/auth: BasicAuth method in standard pkg I created a new PR from https://github.com/coreos/etcd/pull/3598. This is for `TODO: use the standard lib BasicAuth method when we move to Go 1.4.` [1]. `BasicAuth` method got into Go standard package a year ago. [2] --- 1. https://github.com/coreos/etcd/blob/master/pkg/netutil/netutil.go#L126-L138 2. https://codereview.appspot.com/76540043/	2015-09-28 14:02:55 -07:00
Xiang Li	1226838381	etcdhttp: add Content-Type: application/json header to version handler	2015-09-25 15:14:13 -07:00
Gyu-Ho Lee	85f4475f62	httptypes/errors: HTTPError.WriteTo returns error Squashing all commits into this one (from https://github.com/coreos/etcd/pull/357). Thanks,	2015-09-25 08:06:26 -07:00
Xiang Li	2540a3fb7e	etcdsever: mismatch error uses the same format as the corresponding flags	2015-09-21 19:32:10 -07:00
Xiang Li	ea3dbfed60	Merge pull request #3408 from MSamman/extend-auth-api etcdserver: extend auth api	2015-09-21 11:51:19 -07:00
Xiang Li	3b70bf87c3	etcdmain: better logging when user forget to set initial flags	2015-09-21 10:43:26 -07:00
Mohammad Samman	6ae1f6c6e4	etcdserver: extend auth api allow recursive query on users and roles to get more detail Fixes #3278	2015-09-21 00:51:18 -07:00
Yicheng Qin	cedad49dcf	Merge pull request #3543 from mitake/reconfig-remove etcdserver: forbid removing started member if quorum cannot be preserved in strict reconfig mode	2015-09-17 18:22:53 -07:00
Hitoshi Mitake	f8859a980d	etcdserver: forbid removing started member if quorum cannot be preserved in strict reconfig mode Like the commit `6974fc63ed`, this commit lets etcdserver forbid removing started member if quorum cannot be preserved after reconfiguration if the option -strict-reconfig-check is passed to etcd. The removal can cause deadlock if unstarted members have wrong peer URLs.	2015-09-18 10:09:57 +09:00
Xiang Li	ec4142576e	Merge pull request #3534 from xiang90/grpc_err etcdserver: better v3 api error handling	2015-09-16 12:32:28 -07:00
Jonathan Boulle	7848ac3979	*: add missing license headers	2015-09-15 14:09:01 -07:00
Xiang Li	94f4069a25	etcdserver: better v3 api error handling	2015-09-15 11:20:06 -07:00
Yicheng Qin	352cd768c6	etcdserver: fix shadow declaration	2015-09-14 23:25:16 -07:00
Yicheng Qin	05c74bd890	etcdserver: rename db file into a formal directory and rename it to a formal name	2015-09-14 22:41:40 -07:00
Yicheng Qin	51f1ee055e	Merge pull request #3526 from yichengq/snapshot etcdserver: forbid to unset v3 demo once used	2015-09-14 21:36:39 -07:00
Yicheng Qin	1f0fb3d9aa	etcdserver: forbid to unset v3 demo once used After enabling v3 demo, it may change the underlying data organization for v3 store. So we forbid to unset --experimental-v3demo once it has been used.	2015-09-14 21:27:11 -07:00
Xiang Li	94f784826a	*: support v3 compaction	2015-09-14 19:59:36 -07:00
Xiang Li	e0d8923f7b	Merge pull request #3524 from xiang90/grpc_error etcdserver: use gRPC error instead of error message in header	2015-09-14 16:38:44 -07:00
Xiang Li	7183387110	etcdserver: use gRPC error instead of error message in header	2015-09-14 16:11:13 -07:00
Gyu-Ho Lee	c2dcf7431e	etcdserver, store: fix grammars in comments (a->an existing) I found some grammatical errors in comments. This pull request was submitted https://github.com/coreos/etcd/pull/3513. I am resubmitting following the correct guidlines.	2015-09-14 13:41:13 -07:00
Xiang Li	c7b4c67436	Merge pull request #3514 from xiang90/v3_raft support clustered v3 api	2015-09-14 09:35:02 -07:00
Xiang Li	4c81615cef	etcdserver: initial support for cluster-wide v3 request	2015-09-13 08:32:01 -07:00
Xiang Li	600456f4ba	etcdserverpb: update proto file for raftInternalRequest We needs to assign each raftInternalRequest an ID for getting the response after it goes through raft. We also needs an empty response for error case.	2015-09-13 08:28:10 -07:00
Hitoshi Mitake	dad32646eb	etcdserver: enhance test cases for isReadyToAddNewMember - a case of a cluster with even number members - a case of an empty cluster	2015-09-13 12:30:10 +09:00
Jonathan Boulle	d9cf752060	etcdserver: add test for isReadyToAddNewMember Also fixed check for special case of one-member cluster	2015-09-13 11:16:08 +09:00
Hitoshi Mitake	6974fc63ed	etcdserver: avoid deadlock caused by adding members with wrong peer URLs Current membership changing functionality of etcd seems to have a problem which can cause deadlock. How to produce: 1. construct N node cluster 2. add N new nodes with etcdctl member add, without starting the new members What happens: After finishing add N nodes, a total number of the cluster becomes 2 * N and a quorum number of the cluster becomes N + 1. It means membership change requires at least N + 1 nodes because Raft treats membership information in its log like other ordinal log append requests. Assume the peer URLs of the added nodes are wrong because of miss operation or bugs in wrapping program which launch etcd. In such a case, both of adding and removing members are impossible because the quorum isn't preserved. Of course ordinal requests cannot be served. The cluster would seem to be deadlock. Of course, the best practice of adding new nodes is adding one node and let the node start one by one. However, the effect of this problem is so serious. I think preventing the problem forcibly would be valuable. Solution: This patch lets etcd forbid adding a new node if the operation changes quorum and the number of changed quorum is larger than a number of running nodes. If etcd is launched with a newly added option -strict-reconfig-check, the checking logic is activated. If the option isn't passed, default behavior of reconfig is kept. Fixes https://github.com/coreos/etcd/issues/3477	2015-09-13 09:31:53 +09:00
Xiang Li	95d5556445	etcdserver: refactor v3demo do	2015-09-05 15:31:28 -07:00
Xiang Li	3f18ded10a	*: v3api index->revision	2015-09-04 10:41:20 -07:00
Xiang Li	2ac9af4924	*: replace consistent token with revision in v3 api	2015-09-03 15:41:33 -07:00
Xiang Li	ef7cf058a2	*: update gogoproto	2015-09-03 15:32:25 -07:00
Tamir Duberstein	45390b9fb8	*: regenerate proto to use local import path Using Go-style import paths in protos is not idiomatic. Normally, this detail would be internal to etcd, but the path from which gogoproto is imported affects downstream consumers (e.g. cockroachdb). In cockroach, we want to avoid including `$GOPATH/src` in our protoc include path for various reasons. This patch puts etcd on the same convention, which allows this for cockroach. More information: https://github.com/cockroachdb/cockroach/pull/2339#discussion_r38663417 This commit also regenerates all the protos, which seem to have drifted a tiny bit.	2015-09-03 13:38:28 -04:00
Xiang Li	a94118893c	Merge pull request #3413 from xiang90/snapshot_dir *: support wal dir	2015-09-01 10:03:50 -07:00
Xiang Li	d94e712d91	*: support wal dir	2015-09-01 09:54:27 -07:00
Yicheng Qin	f3bfcb9dee	etcdserver: add timeout param on getClusterFromRemotePeers It sets 10s timeout for public GetClusterFromRemotePeers. This helps the following cases to work well in high latency scenario: 1. proxy sync members from the cluster 2. newly-joined member sync members from the cluster Besides 10s request timeout, the request is also controlled by dial timeout and read connection timeout.	2015-09-01 08:49:01 -07:00
Xiang Li	1bcaa9f4a1	etcdserver: ignore confChangeUpdateNode in getIDs	2015-08-31 09:36:39 -07:00
Yicheng Qin	92cd24d5bd	*: fix govet shadow check failure	2015-08-27 14:15:30 -07:00
Yicheng Qin	8f6bf029f8	etcdserver: specify request timeout error due to connection lost It specifies request timeout error possibly caused by connection lost, and print out better log for user to understand. It handles two cases: 1. the leader cannot connect to majority of cluster. 2. the connection between follower and leader is down for a while, and it losts proposals. log format: ``` 20:04:19 etcd3 \| 2015-08-25 20:04:19.368126 E \| etcdhttp: etcdserver: request timed out, possibly due to connection lost 20:04:19 etcd3 \| 2015-08-25 20:04:19.368227 E \| etcdhttp: etcdserver: request timed out, possibly due to connection lost ```	2015-08-26 12:38:37 -07:00
Mohammad Samman	e2e002f94e	etcdserver: handle malformed basic auth return insufficient credentials if basic auth header is malformed Fixes #3280	2015-08-25 12:37:24 -07:00
Xiang Li	e3ef1d363a	Merge pull request #3366 from xiang90/v3_proto update v3 proto and doc	2015-08-24 11:22:29 -07:00
Xiang Li	1cccbb5ebd	etcdserverpb: add comments for compaction	2015-08-24 10:52:54 -07:00
Xiang Li	4a5b94478e	etcdserverpb: update comment for txn request	2015-08-24 10:40:05 -07:00
Xiang Li	98ceb3cdbd	etcdserverpb: add more field into rangeResponse	2015-08-24 10:33:20 -07:00
Cong Ding	c09b667d57	*: fix go vet reported issues	2015-08-22 12:19:02 -05:00
Xiang Li	044b23c3ca	Merge pull request #3356 from xiang90/travis *: test gofmt with -s and fix reported issues	2015-08-21 18:59:51 -07:00
Xiang Li	6b23a8131f	*: test gofmt with -s and fix reported issues	2015-08-21 18:52:16 -07:00
Yicheng Qin	8c0610d4f5	Merge pull request #3352 from yichengq/fix-name-url fix that etcd fails to start if using both IP and hostname when discovery srv	2015-08-21 12:38:38 -07:00
Yicheng Qin	72462a72fb	etcdserver: remove TODO to delete URLStringsEqual Discovery SRV supports to compare IP addresses with domain names, so we need URLStringsEqual function.	2015-08-21 09:52:17 -07:00
Yicheng Qin	8ea3d157c5	Revert "Revert "Treat URLs have same IP address as same"" This reverts commit `3153e635d5`. Conflicts: etcdserver/config.go	2015-08-21 09:41:13 -07:00
Xiang Li	11a689d063	etcdserver/auth: cache auth enable result	2015-08-20 23:05:00 -07:00
Yicheng Qin	bcb4d5d53e	Merge pull request #3311 from yichengq/request-timeout extend hardcoded timeout for globally-deployed etcd cluster	2015-08-17 17:00:24 -07:00
Yicheng Qin	1375ef8985	etcdserver: remove getVersion timeout The request can still time out because we have set dial timeout and read/write timeout. It increases timeout expectation from 1s to 5s, but it makes it workable in globally-deployer cluster.	2015-08-17 16:50:40 -07:00
Xiang Li	d487cf6b63	etcdhttp:write etcderror for all errors in keyhandler	2015-08-17 15:51:29 -07:00
Yicheng Qin	c530385d6d	Merge pull request #3313 from yichengq/internal-timeout etcdserver: use ReqTimeout only	2015-08-17 15:05:46 -07:00
Xiang Li	af6d1d3d95	Merge pull request #3310 from xiang90/http_err *: key handler should write auth error as etcd error	2015-08-17 14:57:19 -07:00
Yicheng Qin	2d5b95c49f	etcdserver: use ReqTimeout only We cannot refer RTT value from heartbeat interval, so CommitTimeout is invalid. Remove it and use ReqTimeout instead.	2015-08-17 14:54:25 -07:00
Xiang Li	87f061bab2	*: key handler should write auth error as etcd error	2015-08-17 14:45:45 -07:00
Xiang Li	15e03d801f	etcdserver: add version enforcement when setting cluster version	2015-08-17 11:12:39 -07:00
Xiang Li	f199a484af	*: only print out major.minor version for cluster version	2015-08-15 08:30:06 -07:00
Xiang Li	bbcb38189c	Merge pull request #3302 from xiang90/v etcdserver: better version detection log output	2015-08-14 16:14:55 -07:00
Xiang Li	0076ab154b	etcdserver: better version detection log output Fix https://github.com/coreos/etcd/issues/3288	2015-08-14 16:08:33 -07:00
Xiang Li	dd56b7e05e	Merge pull request #3299 from xiang90/txn initial support for txn	2015-08-14 16:05:16 -07:00
Xiang Li	9233fff48f	etcdserver: support txn	2015-08-14 11:45:31 -07:00
Xiang Li	46865fa5a5	etcdserverpb: update proto	2015-08-14 11:45:07 -07:00
Yicheng Qin	c229e6e655	etcdserver: improve error message when timeout due to leader fail	2015-08-13 15:46:21 -07:00
Yicheng Qin	ceb27b1c48	etcdhttp: add auth capability in 2.2	2015-08-13 14:49:10 -07:00
Yicheng Qin	0fdb77aea2	etcdserver: go back to marshal request in 2.1 way It fixes the problem that 2.1 cannot roll upgrade to 2.2 smoothly because 2.1 cannot understand the bytes marshalled at 2.2.	2015-08-13 13:41:52 -07:00
Yicheng Qin	27170e67b9	etcdserver: specify timeout caused by leader election Before this PR, the timeout caused by leader election returns: ``` 14:45:37 etcd2 \| 2015-08-12 14:45:37.786349 E \| etcdhttp: got unexpected response error (etcdserver: request timed out) ``` After this PR: ``` 15:52:54 etcd1 \| 2015-08-12 15:52:54.389523 E \| etcdhttp: etcdserver: request timed out, possibly due to leader down ```	2015-08-12 16:53:18 -07:00
Yicheng Qin	c3d4d11402	etcdhttp: adjust request timeout based on config It uses heartbeat interval and election timeout to estimate the expected request timeout. This PR helps etcd survive under high roundtrip-time environment, e.g., globally-deployed cluster.	2015-08-12 09:22:59 -07:00
Yicheng Qin	5a91937367	etcdserver: adjust commit timeout based on config It uses heartbeat interval and election timeout to estimate the commit timeout for internal requests. This PR helps etcd survive under high roundtrip-time environment, e.g., globally-deployed cluster.	2015-08-11 21:09:03 -07:00
Xiang Li	a718329ad3	Merge pull request #3248 from xiang90/v3 initial v3 demo	2015-08-10 13:59:03 -07:00
Brandon Philips	fb1951204c	etcdserver: move atomics to make etcd work on arm64 Follow the simple rule in the atomic package: "On both ARM and x86-32, it is the caller's responsibility to arrange for 64-bit alignment of 64-bit words accessed atomically. The first word in a global variable or in an allocated struct or slice can be relied upon to be 64-bit aligned." Tested on a system with /proc/cpuinfo reporting: processor : 0 model name : ARMv7 Processor rev 1 (v7l) Features : swp half thumb fastmult vfp edsp thumbee neon vfpv3 tls vfpv4 idiva idivt vfpd32 lpae evtstrm CPU implementer : 0x41 CPU architecture: 7 CPU variant : 0x0 CPU part : 0xc0d CPU revision : 1	2015-08-08 18:11:41 -07:00
Xiang Li	9ff7075ce8	etcdserver: use v3server interface	2015-08-08 10:39:04 -07:00
Xiang Li	f004b4dac7	*: etcdserver supports v3 demo	2015-08-08 05:58:29 -07:00
Xiang Li	82afadbcc6	etcdserverpb: update proto	2015-08-08 05:31:35 -07:00
Xiang Li	845c51fedd	*: fix typos vaild->valid	2015-08-07 10:57:11 -07:00
Yicheng Qin	f03f048232	Merge pull request #3184 from yichengq/fast-bootstrap etcdserver: tick ElectionTicks before starting when bootstrap new cluster	2015-08-06 15:54:40 -07:00
Yicheng Qin	21f5b885f2	etcdserver: fast election timeout when bootstrap cluster The behavior accelarates the happen of the first-time leader election, so the cluster could elect its leader fast. Technically, it could help to reduce `electionMs - heartbeatMs` wait time for the first leader election. Main usage: 1. Quick start for the local cluster when setting a little longer election timeout 2. Quick start for the global cluster, which sets election timeout to its maximum 50s.	2015-08-06 15:44:26 -07:00
Yicheng Qin	a637e86372	Merge pull request #3220 from yichengq/fix-auth-check etcdhttp: fix access check for multiple roles in auth	2015-08-06 15:09:04 -07:00
Xiang Li	58503817ec	etcdserver: internal request union	2015-08-05 07:47:10 -07:00
Yicheng Qin	18169e896c	etcdhttp: fix access check for multiple roles in auth Check access for multiple roles should go through all roles.	2015-08-04 14:31:07 -07:00
Xiang Li	2b8abeb093	*: remove migration related stuff from 2.2	2015-08-01 19:37:20 +08:00
Barak Michener	dd1a8fe330	etcdhttp: Improve test coverage surrounding auth	2015-07-30 14:21:08 -04:00
Xiang Li	80b794dccc	Merge pull request #3185 from xiang90/add_debug_endpoint etcdhttp: add config/local/debug endpoint	2015-07-30 08:46:07 +08:00
Xiang Li	4e31df2c2b	etcdhttp: add config/local/log endpoint PUT on the endpoint sets the GlobalDebugLevel to json level value. The action overwrites the origianl log level setting from users. We need to write doc to warn this.	2015-07-30 08:35:01 +08:00
Yicheng Qin	6fc9dbfe56	Merge pull request #3114 from yichengq/clean-raft-init etcdserver: clean up start and stop logic of raft	2015-07-27 14:19:25 -07:00
Yicheng Qin	7696dd3280	etcdserver: clean up start and stop logic of raft kill TODO and make it more readable.	2015-07-27 13:24:26 -07:00
Xiang Li	53a77fa519	*: tnx -> txn	2015-07-24 23:21:09 +08:00
Yicheng Qin	b7892b20c1	etcdserver: rename defaultPublishRetryInterval -> defaultPublishTimeout This makes code more readable and reasonable.	2015-07-23 10:09:28 -07:00
Yicheng Qin	5be545b872	Merge pull request #3077 from yichengq/fix-test-sync etcdserver: init raft internal var early	2015-07-10 14:44:52 -07:00
Xiang Li	2fb8347d36	etcdserver: add rpc proto	2015-06-29 20:00:09 -07:00
Xiang Li	581ef05bab	*: resolve proto warnings	2015-06-29 18:39:46 -07:00
Xiang Li	13f44e4b79	*: update generated proto code	2015-06-29 16:45:25 -07:00
Yicheng Qin	7f95780bfb	etcdserver: init raft internal var early Its `stopped`/`done` should be created always before being used in defer in server loop. It fixes the race detected when running TestSyncTrigger.	2015-06-29 15:34:15 -07:00
Yicheng Qin	2e41b4f9e1	etcdserver/auth: fix return value when creating root user Before: ``` $ curl http://127.0.0.1:4001/v2/auth/users/root -XPUT -d '{"user": "root", "password": "root"}' {"user":"root","roles":null} ``` After: ``` {"user":"root","roles":["root"]} ```	2015-06-27 23:16:54 -07:00
Barak Michener	acca9cc3a9	Merge pull request #3047 from barakmich/auth_cov auth: improve test coverage	2015-06-25 14:47:22 -04:00
Barak Michener	39c10d1fe4	auth: improve test coverage	2015-06-25 14:25:08 -04:00
Yicheng Qin	5d131acfba	etcdserver: fix TestTriggerSnap Before checking, it needs to wait for snapshot goroutine to finish its work.	2015-06-25 09:58:36 -07:00
Xiang Li	52c2a5731f	etcdserver: fix typo in metrics.go	2015-06-24 12:42:40 -07:00
Xiang Li	030d1bbf2d	auth: do not allow update root role	2015-06-23 20:15:08 -07:00
Xiang Li	e291dfd748	etcdhttp: improve user endpoint validation Giving both roles and grant/revoke is not allowed. Creating an existing user is not allowed. Updating a non-existing user is not allowed.	2015-06-23 14:38:44 -07:00
Xiang Li	c8628c8fe5	auth: separate the role create and update path Giving both permission and grant/revoke is not allowed. Creating an existing role is not allowed. Updating a non-existing is not allowed.	2015-06-23 13:15:32 -07:00
Xiang Li	bc61056912	etcdhttp: use correct http status const when writing http error	2015-06-23 12:40:30 -07:00
Xiang Li	4f47a6ebfb	Merge pull request #3032 from xiang90/refactor_update_role auth: refactor updateRole	2015-06-23 11:17:45 -07:00
Barak Michener	d5a0e3ac6a	etcdhttp: Always strip password hash when returning users	2015-06-22 18:39:16 -04:00
Xiang Li	979f531261	auth: refactor updateRole We will return error if revoke or grant fails to update the role. No need to check if revoke or grant is nil or not.	2015-06-22 15:16:10 -07:00
Xiang Li	3f82e7b116	auth: do not allow to grant duplicate role or revoke ungranted role to a user	2015-06-22 15:11:09 -07:00
Barak Michener	51a65599dd	Merge pull request #3021 from xiang90/auth_err etcdserver: use correct http status code for auth error	2015-06-22 14:58:33 -04:00
Xiang Li	c39aad0e92	etcdserver: use correct http status code for auth error	2015-06-22 09:28:47 -07:00
Xiang Li	3e4479b0cd	Merge pull request #3022 from xiang90/aut_type etcdhttp: fix the response type for auth	2015-06-21 15:06:35 -07:00
Xiang Li	d295d21349	etcdserver: better log message for url mismatch	2015-06-19 19:36:26 -07:00
Xiang Li	cad757efa0	etcdhttp: fix the response type for auth	2015-06-19 15:19:00 -07:00
Barak Michener	64ec8af91b	*: Rename `security` to `auth`	2015-06-15 18:18:50 -04:00
Antoine Grondin	270487d340	etcdserver: use Infof to print formatted argument	2015-06-14 20:22:21 +07:00
Xiang Li	8ad7ed321e	*:godep log pkg	2015-06-11 14:22:14 -07:00
Xiang Li	f013a627a4	etcdserver/stats: use leveled log	2015-06-11 14:22:14 -07:00
Xiang Li	cf7cb2b8a9	etcdserver/security: use leveled log	2015-06-11 14:22:14 -07:00
Xiang Li	2f795e42d0	httptypes: use leveled log	2015-06-11 14:19:53 -07:00
Barak Michener	7bf0479e66	Merge pull request #2882 from barakmich/security_client_new *: Add security/authorization to etcd/client and etcdctl	2015-06-11 13:40:32 -04:00
Yicheng Qin	1af2b4cad7	rafthttp: fix TestUpdateMember Before this PR, it may error like this: ``` --- FAIL: TestUpdateMember-2 (0.00s) server_test.go:950: action = [{ApplyConfChange:ConfChangeUpdateNode []} {ProposeConfChange:ConfChangeUpdateNode []}], want [{ProposeConfChange:ConfChangeUpdateNode []} {ApplyConfChange:ConfChangeUpdateNode []}] ``` This fixes the test by recording the proposal event in time.	2015-06-11 09:45:34 -07:00
Yicheng Qin	cd629c9b44	Merge pull request #2939 from yichengq/fix-update-attr etcdserver: allow to update attributes of removed member	2015-06-10 16:53:39 -07:00
Yicheng Qin	8725e69cf7	etcdserver: allow to update attributes of removed member There exist the possiblity to update attributes of removed member in reasonable workflow: 1. start member A 2. leader receives the proposal to remove member A 2. member A sends the proposal of update its attribute to the leader 3. leader commits the two proposals So etcdserver should allow to update attributes of removed member.	2015-06-10 16:52:18 -07:00
Yicheng Qin	4e79abcfeb	Merge pull request #2944 from yichengq/fix-2procs pkg/testutil: ForceGosched -> WaitSchedule	2015-06-10 14:44:32 -07:00
Yicheng Qin	018fb8e6d9	pkg/testutil: ForceGosched -> WaitSchedule ForceGosched() performs bad when GOMAXPROCS>1. When GOMAXPROCS=1, it could promise that other goroutines run long enough because it always yield the processor to other goroutines. But it cannot yield processor to goroutine running on other processors. So when GOMAXPROCS>1, the yield may finish when goroutine on the other processor just runs for little time. Here is a test to confirm the case: ``` package main import ( "fmt" "runtime" "testing" ) func ForceGosched() { // possibility enough to sched up to 10 go routines. for i := 0; i < 10000; i++ { runtime.Gosched() } } var d int func loop(c chan struct{}) { for { select { case <-c: for i := 0; i < 1000; i++ { fmt.Sprintf("come to time %d", i) } d++ } } } func TestLoop(t *testing.T) { c := make(chan struct{}, 1) go loop(c) c <- struct{}{} ForceGosched() if d != 1 { t.Fatal("d is not incremented") } } ``` `go test -v -race` runs well, but `GOMAXPROCS=2 go test -v -race` fails. Change the functionality to waiting for schedule to happen.	2015-06-10 14:37:41 -07:00
Barak Michener	a4d1a5a6e5	*: Add security/auth support to etcdctl and etcd/client add godep for speakeasy and auth entry parsing add security_user to client add role to client add role commands add auth support to etcdclient and etcdctl(member/user) add enable/disable to etcdctl better error messages, read/write/readwrite Bump go-etcd to include codec changes, add new dependency verify the error for revoke/add if nothing changed, remove security-merging prefix	2015-06-10 16:58:10 -04:00
Xiang Li	19ef3a0982	Merge pull request #2934 from xiang90/etcdserver_log etcdserver: use leveled logging	2015-06-09 15:53:52 -07:00
Xiang Li	e0f9796653	etcdserver: use leveled logging Leveled logging for etcdserver pkg.	2015-06-09 13:53:07 -07:00
Yicheng Qin	9fbd2599ad	Merge pull request #2940 from yichengq/improve-raft-loop etcdserver: stop raft loop when receiving stop signal	2015-06-09 11:24:53 -07:00
Yicheng Qin	0814966ca2	etcdserver: stop raft loop when receiving stop signal When it waits for apply to be done, it should stop the loop if it receives stop signal. This helps to print out panic information. Before this PR, if the panic happens when server loop is applying entries, server loop will wait for raft loop to stop forever.	2015-06-09 11:11:53 -07:00
Brian Akins	d8a836e618	Simple debug HTTP request logging	2015-06-09 13:40:37 -04:00
Xiang Li	0adeee2965	etcdhttp: use leveled logging	2015-06-09 09:26:57 -07:00
Xiang Li	3af4a45d7b	etcdserver: make raft use leveled logger	2015-06-02 12:50:42 -07:00
Xiang Li	42fe370b35	Merge pull request #2848 from xiang90/metrics *: use namespace and subsystem in metrics	2015-05-26 14:44:54 -07:00
Xiang Li	34ac145b38	*: use namespace and subsystem in metrics Fix #2841. From Prometheus developer: ``` the recommended way for etcd as an open source project and under consideration of its size would be etcd_<subsystem>_<name>. ``` We made the naming change accordingly.	2015-05-26 14:39:04 -07:00
Xiang Li	3028edd7dc	Merge pull request #2856 from xiang90/mrefactor etcdserver: refactore member.go	2015-05-26 14:37:37 -07:00
Barak Michener	9ef098c5ed	etcdserver: fix go vet. Fixes #2859	2015-05-22 13:54:54 -04:00
Xiang Li	58eefda72d	Merge pull request #2840 from yichengq/revert-url-equal Revert "Treat URLs have same IP address as same"	2015-05-21 19:27:19 -07:00
Xiang Li	4a72d3a8bb	etcdserver: refactore member.go	2015-05-21 09:19:29 -07:00
Xiang Li	260aad5468	Merge pull request #2830 from xiang90/join_checking checking cluster version compatibility before joining the existing cluster	2015-05-20 12:25:50 -07:00
Xiang Li	aa417ab644	etcdserver: log the per endpoint error in getVersion	2015-05-20 12:10:10 -07:00
Xiang Li	db7db689a6	etcdserver: check cluster version compability when joining	2015-05-19 10:19:41 -07:00
Barak Michener	a88a53274f	security: Lazily create the security directories. Fixes #2755 , may find new instances for #2741 revert the kv integration test fix nits amend security mention of GUEST	2015-05-18 17:28:04 -04:00
Yicheng Qin	3153e635d5	Revert "Treat URLs have same IP address as same" This reverts commit `f8ce5996b0`. etcd no longer resolves TCP addresses passed in through flags, so there is no need to compare hostname and IP slices anymore. (for more details: `a3892221ee`) Conflicts: etcdserver/cluster.go etcdserver/config.go pkg/netutil/netutil.go pkg/netutil/netutil_test.go	2015-05-16 03:21:10 -07:00
Xiang Li	9f8342dba4	etcdserver: do not get local version via HTTP	2015-05-13 17:19:32 -07:00
Xiang Li	988c30bfba	etcdserver: getVersion returns both server and cluster version	2015-05-13 17:04:46 -07:00
Xiang Li	6296054ff6	etcdhttp: version endpoint also returns cluster version.	2015-05-13 15:48:10 -07:00
Yicheng Qin	75ee7f4aa1	Merge pull request #2821 from yichengq/private-cluster etcdserver: stop exposing Cluster struct	2015-05-13 10:26:48 -07:00
Xiang Li	2690535f8a	Merge pull request #2820 from xiang90/cap version capability checking	2015-05-13 10:16:49 -07:00
Xiang Li	d3b1d5c008	etcdhttp: support capability checking etcdhttp will check the cluster version and update its capability version periodically. Any new handler's after 2.0 needs to wrap by capability handler to ensure it is not accessable until rolling upgrade finished.	2015-05-13 10:11:35 -07:00
Yicheng Qin	a6a649f1c3	etcdserver: stop exposing Cluster struct After this PR, only cluster's interface Cluster is exposed, which makes code much cleaner. And it avoids external packages to rely on cluster struct in the future.	2015-05-13 10:01:25 -07:00
Xiang Li	f2905f2828	etcdserver: remove unnecessary around detect datadir The log is super unhelpful. When I have a 2.1.0 etcd, it prints out `2.0.1 vaild dir`. I have no idea why the data dir of a 2.1.0 etcd is 2.0.1.	2015-05-12 22:06:42 -07:00
Yicheng Qin	032db5e396	*: extract types.Cluster from etcdserver.Cluster The PR extracts types.Cluster from etcdserver.Cluster. types.Cluster is used for flag parsing and etcdserver config. There is no need to expose etcdserver.Cluster public, which contains lots of etcdserver internal details and methods. This is the first step for it.	2015-05-12 14:53:11 -07:00
Xiang Li	e866314b94	etcdserver: support update cluster version through raft 1. Persist the cluster version change through raft. When the member is restarted, it can recover the previous known decided cluster version. 2. When there is a new leader, it is forced to do a version checking immediately. This helps to update the first cluster version fast.	2015-05-12 11:44:34 -07:00
Xiang Li	94ffd72c7e	etcdserver: rename StoreAdminPrefix to StoreClusterPrefix We store cluster related key in StoreAdminPrefix for some historical reason. The previous API is called admin. But now, the admin name is gone and `cluster` is a more clear and correct name.	2015-04-29 12:05:51 -07:00
Xiang Li	6699107f61	*: add cluster version and cluster version detection. Cluster version is the min major.minor of all members in the etcd cluster. Cluster version is set to the min version that a etcd member is compatible with when first bootstrapp. During a rolling upgrades, the cluster version will be updated automatically. For example: ``` Cluster [a:1, b:1 ,c:1] -> clusterVersion 1 update a -> 2, b -> 2 after a detection Cluster [a:2, b:2 ,c:1] -> clusterVersion 1, since c is still 1 update c -> 2 after a detection Cluster [a:2, b:2 ,c:2] -> clusterVersion 2 ``` The API/raft component can utilize clusterVersion to determine if it can accept a client request or a raft RPC. We choose polling rather than pushing since we want to use the same logic for cluster version detection and (TODO) cluster version checking. Before a member actually joins a etcd cluster, it should check the version of the cluster. Push does not work since the other members cannot push version info to it before it actually joins. Moreover, we do not want our raft RPC system (which is doing the heartbeat pushing) to coordinate cluster version.	2015-04-29 11:31:59 -07:00
Yicheng Qin	1c1cccd236	rafthttp: stop etcd if it is found removed when stream dial The original process is stopping etcd only when pipeline message finds itself has been removed. After this PR, stream dial has this functionality too. It helps fast etcd stop, which doesn't need to wait for stream break to fall back to pipeline, and wait for election timeout to send out message to detect self removal.	2015-04-27 15:10:00 -07:00
Yicheng Qin	ebecee34e0	Merge pull request #2701 from yichengq/rafthttp-anon rafthttp: add remotes	2015-04-24 13:04:37 -07:00
Yicheng Qin	9f19b5660f	rafthttp: add AddRemote Add remotes to rafthttp, who help newly joined members catch up the progress of the cluster. It supports basic message sending to remote, and has no stream connection for simplicity. remotes will not be used after the latest peers have been added into rafthttp.	2015-04-24 11:49:23 -07:00
xiaost	cab1e9a723	etcdserver: skip noop entry in apply	2015-04-24 12:15:51 +08:00
Barak Michener	fa74e702d8	security: Improve the security api as per the suggestions list in #2384 Subcommits: decouple root and security enable/disable create root role prefix matching godep: bump go-etcd to include credentials add godep for speakeasy and auth entry parsing appropriate errors for security enable/disable WIP adding to etcd/client all the security client methods add guest access minor ui return tweaks revert client changes respond to comments, log more security operations fix major ensure() bug, add better UX block recursive access fix some boneheaded mistakes fix integration test last comments fix up security_api.md philips nits fix docs	2015-04-23 16:11:38 -04:00
Yicheng Qin	1d96de459a	etcdserver: init server stats before passing it as argument It is more reasonable to init the variable before passing it as an argument. It fixes a bug that etcdserver may panic on server stats when processing a message from rafthttp streamReader before server stats is initialized in server.Start().	2015-04-22 08:28:08 -07:00
Xiang Li	5ad559b503	*: serve json version on both client and peer url	2015-04-20 16:23:51 -07:00
Yicheng Qin	1811701427	Revert "etcdserver: fix cluster fallback recovery" This reverts commit `cff005777a`. Conflicts: etcdserver/server.go	2015-04-19 11:34:33 -07:00
Yicheng Qin	88224f6f4e	Revert "etcdserver: not apply stale conf change in cluster and transport" This reverts commit `40197f0698`.	2015-04-19 11:08:03 -07:00
Xiang Li	98f8dfbc9d	etcdserver: prevExist=true + condition is compareAndSwap PrevExist indicates the key should exist. Condition compares with an existing key. So PrevExist+condition = CompareAndSwap not Update.	2015-04-14 23:44:06 -07:00
xiaost	eab2c2224a	etcdserver: fix minor bug in EtcdServer.send it seems to nothing serious. after deleted peers, the log may output: "etcdserver: send message to unknown receiver %s"	2015-04-13 20:35:58 +08:00
Yicheng Qin	2141308524	Merge pull request #2631 from yichengq/metrics-fd etcdserver: metrics and monitor number of file descriptor	2015-04-08 11:28:58 -07:00
Yicheng Qin	7a7e1f7a7c	etcdserver: metrics and monitor number of file descriptor It exposes the metrics of file descriptor limit and file descriptor used. Moreover, it prints out warning when more than 80% of fd limit has been used. ``` 2015/04/08 01:26:19 etcdserver: 80% of the file descriptor limit is open [open = 969, limit = 1024] ```	2015-04-08 11:17:48 -07:00
Alex Crawford	d9ad6aa2a9	*: update to use IANA-assigned ports	2015-04-06 13:49:43 -07:00
Xiang Li	471aa1aa89	Merge pull request #2622 from xiang90/fix_watcher store: fix watcher removal	2015-04-03 10:39:03 -07:00
Xiang Li	999917010d	store: fix watcher removal	2015-04-03 10:13:43 -07:00
Yicheng Qin	9e5743c816	etcdserver: stop raft node goroutine before stop server Stop raftNode goroutine before stopping server goroutine, so server.Stop does stop all underlying stuffs elegantly now. This fixes the problem that previous-round lock on WAL may not be released when etcd is restarted.	2015-04-01 11:20:51 -07:00
Xiang Li	77a04cda0c	Merge pull request #2597 from xiang90/wal-repair wal: fix the unexpectedEOF error in the last wal.	2015-03-30 13:49:05 -07:00
Xiang Li	253f7c4ae1	Merge pull request #2522 from xiang90/user_pw etcdserver/etcdhttp: do not return back the password of a user	2015-03-30 13:42:41 -07:00
Xiang Li	0b9a318e68	etcdserver: make the wal repairing logic clear	2015-03-29 21:10:28 -07:00
Xiang Li	1231f82f22	etcdserver: save snapshot into wal first	2015-03-29 14:23:05 -07:00
Xiang Li	8b4eed29e5	wal: fix the unexpectedEOF error in the last wal. It is safe to repair the unexpectedEOF error in the last wal. raft will not send out message before the entry successfully comitted into wal. Thus we can safely truncate the last entry in the wal to repair.	2015-03-28 21:08:14 -07:00
Yicheng Qin	60efd4d96e	Revert "etcdhttp: add internalVersion" This reverts commit `a77bf97c14`. Conflicts: version/version.go	2015-03-27 16:53:55 -07:00
Yicheng Qin	dd92a2b484	Merge pull request #2556 from yichengq/fix-apply-conf etcdserver: not apply stale conf change	2015-03-27 14:00:30 -07:00
Kelsey Hightower	538d624cfa	etcdserver: add stats.LatencyStats and stats.CountsStats types	2015-03-27 13:42:44 -07:00
Yicheng Qin	40197f0698	etcdserver: not apply stale conf change in cluster and transport	2015-03-27 12:53:34 -07:00
Xiang Li	e3817adb5b	etcdserver: loose member validation for joining existing cluster	2015-03-25 13:59:22 -07:00
Xiang Li	05e240b892	*: update protobuf	2015-03-25 10:14:35 -07:00
Yicheng Qin	5e0077cc0c	etcdserver: print out extra files in data dir instead of erroring	2015-03-24 18:56:22 -07:00
Xiang Li	866a9d4e41	Merge pull request #2568 from xiang90/raftnode raft: make node configurable	2015-03-24 11:18:22 -07:00
Yicheng Qin	ea78f5d1aa	Merge pull request #2552 from yichengq/fix-2396 etcdserver: check -initial-cluster in join case	2015-03-23 22:46:38 -07:00
Yicheng Qin	abcd828114	etcdserver: add join-existing check	2015-03-23 22:31:20 -07:00
Xiang Li	abddef0f28	raft: make node configurable	2015-03-23 21:20:49 -07:00
Kelsey Hightower	4611c3b2d7	netutil: add BasicAuth function etcd ships it's own BasicAuth function and no longer requires Go 1.4 to build.	2015-03-20 17:32:33 -07:00
Xiang Li	9d28f94005	etcdserver/etcdhttp: do not return back the password of a user	2015-03-16 22:35:01 -07:00
Xiang Li	f3e4dbf967	etcdserver/etcdhttp: write the http error to response writer	2015-03-16 15:24:19 -07:00
Xiang Li	bba7f75562	Merge pull request #2517 from yichengq/fix-sec2 security: fix var shadowing in CreateOrUpdateUser	2015-03-16 15:08:55 -07:00
Yicheng Qin	8335a5407b	security: fix var shadowing in CreateOrUpdateUser	2015-03-16 14:59:05 -07:00
Yicheng Qin	d7780cf293	security: fix var shadowing in CreateOrUpdate	2015-03-16 14:55:04 -07:00
Barak Michener	001efa0639	security: Implement RBAC security for etcd stub out security further wip Last stub before CRUD for roles Complete role merging start tests add Godep for golang.org/x/crypto/bcrypt first round of comments add tests, remove root addition (will be added back as part of creation) Add security checks for /v2/machines and /v2/keys Allow non-root to determine if security is enabled, get machine list. Responding to comments, remove multiple verbs (like /v2/security/user/foo/password) add some prefixes to the logging	2015-03-16 16:23:11 -04:00
Xiang Li	d015610da5	etcdserver: separate apply and raft routine	2015-03-10 13:34:24 -07:00
Yicheng Qin	b4b9b9118a	rafthttp: report MsgSnap status	2015-03-02 09:38:11 -08:00
Yicheng Qin	9989bf1d36	Merge pull request #2407 from yichengq/334 rafthttp: report unreachable status of the peer	2015-03-02 09:35:35 -08:00

... 2 3 4 5 6 ...

950 Commits (2e051c1c610496ccfc44389ff89eab49504d7176)