You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Grant Henke (Jira)" <ji...@apache.org> on 2020/06/02 17:37:00 UTC

[jira] [Resolved] (KUDU-1909) kudu tserver always starting pre-election

     [ https://issues.apache.org/jira/browse/KUDU-1909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Henke resolved KUDU-1909.
-------------------------------
    Fix Version/s: NA
       Resolution: Cannot Reproduce

> kudu tserver always starting pre-election
> -----------------------------------------
>
>                 Key: KUDU-1909
>                 URL: https://issues.apache.org/jira/browse/KUDU-1909
>             Project: Kudu
>          Issue Type: Bug
>          Components: consensus
>    Affects Versions: 1.2.0
>         Environment: three tserver nodes
> three masterServer nodes
>            Reporter: lizhong
>            Priority: Major
>             Fix For: NA
>
>
> I have some problems with the kudu tabletServer
> I installed three tserver nodes, one of which has stopped yesterday, and today I started it up and looked up the log and found that the node was starting the election. But from the web-ui can be observed, the table data is no problem. What can I do to get it to launch an election?
> Specific log information:
> Term 5 pre-election: Requesting pre-vote from peer ecaaa3e46f914e83a99b47f44e1e1045
> I0304 11: 11: 20.930647 46466 leader_election.cc:216] T 305900d1b718492dbb20c6dbd75a9c55 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Requesting pre-vote from peer 97625518ce894cedaf58917c776e1679
> I0304 11: 11: 20.930873 46266 leader_election.cc:362] T 305900d1b718492dbb20c6dbd75a9c55 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Vote denied by peer ecaaa3e46f914e83a99b47f44e1e1045 Message:. Invalid argument: T 305900d1b718492dbb20c6dbd75a9c55 P ecaaa3e46f914e83a99b47f44e1e1045 [term 4 FOLLOWER]: Leader pre -election vote request: Denying vote to candidate f384b386ce4e4c1db18eff0e8dfa41d6 for term 5 because replica is either leader or believes a valid leader to be alive
> I0304 11: 11: 20.930898 46264 leader_election.cc:362] T 305900d1b718492dbb20c6dbd75a9c55 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Vote denied by peer 97625518ce894cedaf58917c776e1679 Message:. Invalid argument: T 305900d1b718492dbb20c6dbd75a9c55 P 97625518ce894cedaf58917c776e1679 [term 4 LEADER]: Leader pre -election vote request: Denying vote to candidate f384b386ce4e4c1db18eff0e8dfa41d6 for term 5 because replica is either leader or believes a valid leader to be alive
> I0304 11: 11: 20.930920 46264 leader_election.cc:243] T 305900d1b718492dbb20c6dbd75a9c55 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Election call. Result: candidate lost.
> I0304 11: 11: 20.931056 47979 raft_consensus.cc:2162] T 305900d1b718492dbb20c6dbd75a9c55 P f384b386ce4e4c1db18eff0e8dfa41d6 [term 4 FOLLOWER]: Snoozing failure detection for election timeout plus an additional 18.685s
> I0304 11: 11: 20.931077 47979 raft_consensus.cc: 1908] T 305900d1b718492dbb20c6dbd75a9c55 P f384b386ce4e4c1db18eff0e8dfa41d6 [term 4 FOLLOWER]: Leader pre-election lost for term 5. Reason: could not put majority
> I0304 11: 11: 21.031502 46510 raft_consensus.cc:411] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [term 4 FOLLOWER]: Starting pre-election (no leaderunicail within the election timeout)
> I0304 11: 11: 21.031525 46510 raft_consensus.cc:2162] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [term 4 FOLLOWER]: Snoozing failure detection for election timeout plus an additional 11.912s
> I0304 11: 11: 21.031556 46510 raft_consensus.cc:435] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [term 4 FOLLOWER]: Starting pre-election with config: opid_index: -1 OBSOLETE_local: false peers {permanent_uuid: "ecaaa3e46f914e83a99b47f44e1e1045" member_type: VOTER last_known_addr {host : "Hadoopdata03vl" port: 7050}} peers {permanent_uuid: "" "" ":" hasoopdata02vl "port: 7050}" peers {permanent_uuid: "97625518ce894cedaf58917c776e1679" member_type: VOTER last_known_addr {host: "hadoopdata02vl" port: 7050}}
> I0304 11:11:21.031711 46510 leader_election.cc:216] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Requesting pre-vote from peer ecaaa3e46f914e83a99b47f44e1e1045
> I0304 11:11:21.031736 46510 leader_election.cc:216] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Requesting pre-vote from peer 97625518ce894cedaf58917c776e1679
> I0304 11:11:21.031921 46266 leader_election.cc:362] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Vote denied by peer ecaaa3e46f914e83a99b47f44e1e1045. Message: Invalid argument: T 0018e75858ce423191e95913c5fd3b56 P ecaaa3e46f914e83a99b47f44e1e1045 [term 4 LEADER]: Leader pre-election vote request: Denying vote to candidate f384b386ce4e4c1db18eff0e8dfa41d6 for term 5 because replica is either leader or believes a valid leader to be alive.
> I0304 11:11:21.031965 46264 leader_election.cc:362] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Vote denied by peer 97625518ce894cedaf58917c776e1679. Message: Invalid argument: T 0018e75858ce423191e95913c5fd3b56 P 97625518ce894cedaf58917c776e1679 [term 4 FOLLOWER]: Leader pre-election vote request: Denying vote to candidate f384b386ce4e4c1db18eff0e8dfa41d6 for term 5 because replica is either leader or believes a valid leader to be alive.
> I0304 11:11:21.031983 46264 leader_election.cc:243] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [CANDIDATE]: Term 5 pre-election: Election decided. Result: candidate lost.
> I0304 11:11:21.032114 47980 raft_consensus.cc:2162] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [term 4 FOLLOWER]: Snoozing failure detection for election timeout plus an additional 10.992s
> I0304 11:11:21.032133 47980 raft_consensus.cc:1908] T 0018e75858ce423191e95913c5fd3b56 P f384b386ce4e4c1db18eff0e8dfa41d6 [term 4 FOLLOWER]: Leader pre-election lost for term 5. Reason: could not achieve majority
> I0304 11:11:28.366984 46255 tablet.cc:902] T 9e1c3635babe46b392b0261b6c392ca7 Flush: entering stage 1 (old memrowset already frozen for inserts)
> I0304 11:11:28.367014 46255 compaction.cc:877] Selected 1 rowsets to compact:
> I0304 11:11:28.367019 46255 compaction.cc:880] memrowset(current size on disk: ~0 bytes)
> I0304 11:11:28.367022 46255 tablet.cc:904] T 9e1c3635babe46b392b0261b6c392ca7 Memstore in-memory size: 15703 bytes
> I0304 11:11:28.367028 46255 tablet.cc:1159] T 9e1c3635babe46b392b0261b6c392ca7 Flush: entering phase 1 (flushing snapshot). Phase 1 snapshot: MvccSnapshot[committed={T|T < 6097293645617463296 or (T in {6097293645617463296})}]
> I0304 11:11:28.367106 46255 multi_column_writer.cc:85] Opened CFile writer for column system[string NOT NULL]
> I0304 11:11:28.367138 46255 multi_column_writer.cc:85] Opened CFile writer for column event[string NOT NULL]
> I0304 11:11:28.367162 46255 multi_column_writer.cc:85] Opened CFile writer for column time[int64 NOT NULL]
> I0304 11:11:28.367185 46255 multi_column_writer.cc:85] Opened CFile writer for column user_ip[string NOT NULL]
> I0304 11:11:28.367205 46255 multi_column_writer.cc:85] Opened CFile writer for column uid[string NULLABLE]
> I0304 11:11:28.367224 46255 multi_column_writer.cc:85] Opened CFile writer for column url[string NULLABLE]
> I0304 11:11:28.367244 46255 multi_column_writer.cc:85] Opened CFile writer for column method[string NULLABLE]
> I0304 11:11:28.376103 46255 tablet.cc:1244] T 9e1c3635babe46b392b0261b6c392ca7 Flush: entering phase 2 (starting to duplicate updates in new rowsets)
> I0304 11:11:28.384232 46255 tablet.cc:1299] T 9e1c3635babe46b392b0261b6c392ca7 Flush Phase 2: carrying over any updates which arrived during Phase 1
> I0304 11:11:28.384249 46255 tablet.cc:1301] T 9e1c3635babe46b392b0261b6c392ca7 Phase 2 snapshot: MvccSnapshot[committed={T|T < 6097293645617463296 or (T in {6097293645617463296})}]
> I0304 11:11:28.399137 46255 tablet.cc:1343] T 9e1c3635babe46b392b0261b6c392ca7 Flush successful on 28 rows (6782 bytes)
> I0304 11:11:28.399181 46255 maintenance_manager.cc:372] Time spent running FlushMRSOp(9e1c3635babe46b392b0261b6c392ca7): real 0.041s    user 0.032s sys 0.001s
> I0304 11:11:28.399209 46255 maintenance_manager.cc:378] FlushMRSOp(9e1c3635babe46b392b0261b6c392ca7) metrics: {"cfile_init":1,"data dir 0.queue_time_us":119,"data dir 0.run_cpu_time_us":143,"data dir 0.run_wall_time_us":7842,"fdatasync":21,"fdatasync_us":8624,"lbm_read_time_us":7,"lbm_reads_lt_1ms":4,"lbm_write_time_us":220,"lbm_writes_lt_1ms":48,"thread_start_us":104,"threads_started":1}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)