Hi, We've just encountered a very similar issue, in a DFZ setup. The router has a number of transit and peering peers. We lost the layer 2 connectivity for most of these peers, which resulted in bird6 crashing when it detected that the first peer was unreachable: Jan 20 04:53:07 estran bird6: lyonix_sfr: Error: Hold timer expired Jan 20 04:53:07 estran kernel: [208450.426908] bird6[5735]: segfault at 18 ip 00007ff06e5f6aa4 sp 00007fffd5dd0fe0 error 4 in bird6[7ff06e5eb000+74000] At the same time, bird correctly walked over all unreachable peers and didn't crash: Jan 20 04:53:14 estran bird: lyonix_sfr: Error: Hold timer expired Jan 20 04:53:25 estran bird: ipmax1: Error: Hold timer expired Jan 20 04:53:51 estran bird: lyonix_rs2: Error: Hold timer expired Jan 20 04:54:04 estran bird: ielo: Error: Hold timer expired This is on Debian wheezy, with the bird package from wheezy-backports (Version : 1.4.5-1~bpo70+1) BGP config is attached. There isn't anything in the logs, other than the two lines above. Any idea? We have three peerings in the same VLAN, but unlike Matthias, the neighbours addresses are different. On Mon, Jan 12, 2015 at 11:02:10PM +0100, Matthias Schiffer wrote:
Hi, recently bird6 has been crashing quite often (several times a week) on one of our systems (I don't know why only one of two systems with very similar config is affected). Our config contains ~50 BGP peerings, the number of exchanged routes is about 100 in total.
I was able to get a core of the crash, it can be found at
http://home.universe-factory.net/neoraider/bird6.core
I've put the corresponding binary (with debug information) at
http://home.universe-factory.net/neoraider/bird6
bird was built on Arch Linux with GCC 4.9.2.
Please let me know if you need any more information...
Thanks in advance, Matthias