Ondrej Zajicek пишет:
On Thu, Jan 28, 2010 at 04:40:54PM +0300, Mikhail A. Grishin wrote:
Hi,
We found that "Finite state machine error" problem is not related to your patches. It randomly occurs on our production server at the time of daemon startup :((
The problem is occurs on small number of peers. (2 or 3 or 4 from ~280) Some problem peers are the same at next startup, some - not.
On test server with small number of active peers (and same config) we doesn't see this issue.
What can be done? Right now we see the problem on pure 1.2.0 release...
This might be a buggy version of firmware in the neighbor, as well as some strange bug in BIRD.
About "UPDATE message immediately after it sent OPEN" - we ask one of our customers (which hit that problem) to collect debug from his side. See the attachments (3 files).
I can't find the KEEPALIVE message in the log, but i don't know Cisco enough to be sure (perhaps it just does not log it).
The best thing would be to run on route server:
tcpdump -i eth0 -s 0 -v -n ip host 192.168.1.1 > logfile
See attach (dump with another peer, 193.232.246.198, R34485x1) As far as I see, the first one from 193.232.246.198 (18:15:06.785501) is Update, the second one (18:15:06.788230) is Keepalive But why at another daemon run the session is up with the same peer? Direction of session establishing make sense?
(with appropriate network device and IP address of one of problematic neighbors)
and send me that logfile.
-- Mikhail A. Grishin E-mail: magr@ripn.net Phone: +7 (495) 737-0685 MSK-IX & Russian Institute for Public Networks Phone: +7 (499) 192-9179 Network Operations Center 18:15:06.782511 IP (tos 0xc0, ttl 64, id 9139, offset 0, flags [DF], proto TCP (6), length 44) 193.232.246.100.179 > 193.232.246.198.29675: S, cksum 0x711b (incorrect (-> 0xf6c2), 1674454884:1674454884(0) ack 791544152 win 65535 <mss 1460> 18:15:06.784043 IP (tos 0xc0, ttl 1, id 48499, offset 0, flags [DF], proto TCP (6), length 40) 193.232.246.198.29675 > 193.232.246.100.179: ., cksum 0xce7f (correct), ack 1 win 16384 18:15:06.784227 IP (tos 0xc0, ttl 1, id 9140, offset 0, flags [DF], proto TCP (6), length 85) 193.232.246.100.179 > 193.232.246.198.29675: P, cksum 0x7144 (incorrect (-> 0x52f9), 1:46(45) ack 1 win 65535: BGP, length: 45 Open Message (1), length: 45 Version 4, my AS 8631, Holdtime 180s, ID 193.232.246.100 Optional parameters, length: 16 Option Capabilities Advertisement (2), length: 14 Multiprotocol Extensions (1), length: 4 AFI IPv4 (1), SAFI Unicast (1) 18:15:06.785012 IP (tos 0xc0, ttl 1, id 48500, offset 0, flags [DF], proto TCP (6), length 90) 193.232.246.198.29675 > 193.232.246.100.179: P, cksum 0x8384 (correct), 1:51(50) ack 1 win 16384: BGP, length: 50 Open Message (1), length: 50 Version 4, my AS 34485, Holdtime 180s, ID 89.16.63.3 Optional parameters, length: 21 Option Capabilities Advertisement (2), length: 6 Multiprotocol Extensions (1), length: 4 AFI IPv4 (1), SAFI Unicast (1) Option Capabilities Advertisement (2), length: 2 Route Refresh (Cisco) (128), length: 0 Option Capabilities Advertisement (2), length: 2 Route Refresh (2), length: 0 Option Capabilities Advertisement (2), length: 3 Unknown (131), length: 1 no decoder for Capability 131 0x0000: 00 18:15:06.785501 IP (tos 0xc0, ttl 1, id 48501, offset 0, flags [DF], proto TCP (6), length 1450) 193.232.246.198.29675 > 193.232.246.100.179: ., cksum 0xe348 (correct), 51:1461(1410) ack 1 win 16384: BGP, length: 1410 [|BGP Update] 18:15:06.785516 IP (tos 0xc0, ttl 1, id 9144, offset 0, flags [DF], proto TCP (6), length 59) 193.232.246.100.179 > 193.232.246.198.29675: P, cksum 0x712a (incorrect (-> 0x094e), 46:65(19) ack 1461 win 64290: BGP, length: 19 Keepalive Message (4), length: 19 18:15:06.787400 IP (tos 0xc0, ttl 1, id 48502, offset 0, flags [DF], proto TCP (6), length 1500) 193.232.246.198.29675 > 193.232.246.100.179: ., cksum 0xb370 (correct), 1461:2921(1460) ack 65 win 16320: BGP, length: 1460 18:15:06.788230 IP (tos 0xc0, ttl 1, id 48503, offset 0, flags [DF], proto TCP (6), length 1280) 193.232.246.198.29675 > 193.232.246.100.179: P, cksum 0x4f34 (correct), 2921:4161(1240) ack 65 win 16320: BGP, length: 1240 [|BGP] Keepalive Message (4), length: 19 18:15:06.788243 IP (tos 0xc0, ttl 1, id 9145, offset 0, flags [DF], proto TCP (6), length 40) 193.232.246.100.179 > 193.232.246.198.29675: ., cksum 0x7117 (incorrect (-> 0x0233), ack 4161 win 64460 18:15:06.788346 IP (tos 0xc0, ttl 1, id 9146, offset 0, flags [DF], proto TCP (6), length 61) 193.232.246.100.179 > 193.232.246.198.29675: P, cksum 0x712c (incorrect (-> 0xfac8), 65:86(21) ack 4161 win 65535: BGP, length: 21 Notification Message (3), length: 21, Finite State Machine Error (5) 18:15:06.788359 IP (tos 0xc0, ttl 1, id 9147, offset 0, flags [DF], proto TCP (6), length 40) 193.232.246.100.179 > 193.232.246.198.29675: F, cksum 0x7117 (incorrect (-> 0xfde9), 86:86(0) ack 4161 win 65535 18:15:06.789957 IP (tos 0xc0, ttl 1, id 48504, offset 0, flags [DF], proto TCP (6), length 40) 193.232.246.198.29675 > 193.232.246.100.179: ., cksum 0xbe3e (correct), ack 87 win 16299 18:15:06.790548 IP (tos 0xc0, ttl 1, id 48505, offset 0, flags [DF], proto TCP (6), length 1500) 193.232.246.198.29675 > 193.232.246.100.179: ., cksum 0x88e9 (correct), 4161:5621(1460) ack 87 win 16299: BGP, length: 1460 [|BGP Update] 18:15:06.790561 IP (tos 0xc0, ttl 64, id 9154, offset 0, flags [DF], proto TCP (6), length 40) 193.232.246.100.179 > 193.232.246.198.29675: R, cksum 0x7117 (incorrect (-> 0x3ebc), 1674454971:1674454971(0) win 0 18:15:06.790772 IP (tos 0xc0, ttl 1, id 48506, offset 0, flags [DF], proto TCP (6), length 1500) 193.232.246.198.29675 > 193.232.246.100.179: ., cksum 0x612f (correct), 5621:7081(1460) ack 87 win 16299: BGP, length: 1460 18:15:06.790782 IP (tos 0xc0, ttl 64, id 9155, offset 0, flags [DF], proto TCP (6), length 40) 193.232.246.100.179 > 193.232.246.198.29675: R, cksum 0x7117 (incorrect (-> 0x3ebc), 1674454971:1674454971(0) win 0 18:15:06.791011 IP (tos 0xc0, ttl 1, id 48507, offset 0, flags [DF], proto TCP (6), length 1500) 193.232.246.198.29675 > 193.232.246.100.179: ., cksum 0xec9b (correct), 7081:8541(1460) ack 87 win 16299: BGP, length: 1460 [|BGP] [|BGP Update] 18:15:06.791020 IP (tos 0xc0, ttl 64, id 9156, offset 0, flags [DF], proto TCP (6), length 40) 193.232.246.100.179 > 193.232.246.198.29675: R, cksum 0x7117 (incorrect (-> 0x3ebc), 1674454971:1674454971(0) win 0