mirrors/bird - bird

mirror of https://gitlab.nic.cz/labs/bird.git synced 2024-12-23 10:11:53 +00:00

Author	SHA1	Message	Date
Ondrej Zajicek (work)	bbc33f6ec3	Netlink: Add another workaround for older kernel headers Unfortunately, SOL_NETLINK is both recently added and arch-dependent, so we cannot just define it.	2022-01-15 22:39:40 +01:00
Ondrej Zajicek (work)	8988264a64	Netlink: Add workaround for older kernel headers	2022-01-14 23:15:05 +01:00
Ondrej Zajicek (work)	e818f16448	Netlink: Enable strict checking for KRT dumps Add strict checking for netlink KRT dumps to avoid PMTU cache records from FNHE table dump along with KRT. Linux Kernel added FNHE table dump to the netlink API in patch: `8d3b68cd37`.1561131177.git.sbrivio@redhat.com/ Therefore, since Linux 5.3 these route cache entries are dumped together with regular routes during periodic KRT scans, which in some cases may be huge amount of useless data. This can be avoided by using strict checking for netlink dumps: https://lore.kernel.org/netdev/20181008031644.15989-1-dsahern@kernel.org/ The patch mitigates the risk of receiving unknown and potentially large number of FNHE records that would block BIRD I/O in each sync. There is a known issue caused by the GRE tunnels on Linux that seems to be creating one FNHE record for each destination IP address that is routed through the tunnel, even when the PMTU equals to GRE interface MTU. Thanks to Tomas Hlavacek for the original patch.	2022-01-14 21:53:40 +01:00
Ondrej Zajicek (work)	d0dd1d20cd	Netlink: Explicitly skip received cloned routes Kernel uses cloned routes to keep route cache entries, but reports them together with regular routes. They were skipped implicitly as they do not have rtm_protocol filled. Add explicit check for cloned flag and skip such routes explicitly. Also, improve debug logs of skipped routes.	2022-01-14 19:07:57 +01:00
Ondrej Zajicek (work)	60e9def9ef	BGP: Add option 'free bind' The BGP 'free bind' option applies the IP_FREEBIND/IPV6_FREEBIND socket option for the BGP listening socket. Thanks to Alexander Zubkov for the idea.	2022-01-09 02:44:32 +01:00
Alexander Zubkov	87a02489f3	IO: Support nonlocal bind in socket interface Add option to socket interface for nonlocal binding, i.e. binding to an IP address that is not present on interfaces. This behaviour is enabled when SKF_FREEBIND socket flag is set. For Linux systems, it is implemented by IP_FREEBIND socket flag. Minor changes done by commiter.	2022-01-08 19:02:31 +01:00
Ondrej Zajicek (work)	bcb25084d3	Test: Activate some remaining build tests	2022-01-05 20:07:27 +01:00
Ondrej Zajicek (work)	f5c8fb5fba	Netlink: Do not ignore dead routes from BIRD Currently, BIRD ignores dead routes to consider them absent. But it also ignores its own routes and thus it can not correctly manage such routes in some cases. This patch makes an exception for routes with proto bird when ignoring dead routes, so they can be properly updated or removed. Thanks to Alexander Zubkov for the original patch.	2022-01-05 19:25:42 +01:00
Ondrej Zajicek (work)	77d032c71f	Netlink: Improve multipath parsing errors Function nl_parse_multipath() should handle errors internally.	2022-01-05 18:46:41 +01:00
Ondrej Zajicek (work)	29dda184e5	Conf: Fix parsing full-length IPv6 addresses Lexer expression for bytestring was too loose, accepting also full-length IPv6 addresses. It should be restricted such that colon is used between every byte or never. Fix the regex and also add some test cases for it. Thanks to Alexander Zubkov for the bugreport	2022-01-05 16:38:49 +01:00
Matous	75aceadaf7	gitlab-ci.yml: failing gitlab runner fixed. 'registry.labs.nic.cz' -> 'registry.nic.cz' changed	2022-01-05 04:13:39 +01:00
Alexander Zubkov	77042292ff	Doc: Document min/max operators for lists	2021-12-28 04:09:36 +01:00
Alexander Zubkov	0e1fd7ea6a	Filter: Add operators to find minimum and maximum element of sets Add operators .min and .max to find minumum or maximum element in sets of types: clist, eclist, lclist. Example usage: bgp_community.min bgp_ext_community.max filter(bgp_large_community, [(as1, as2, *)]).min Signed-off-by: Alexander Zubkov <green@qrator.net>	2021-12-28 04:07:09 +01:00
Alexander Zubkov	e15e465720	Doc: Document community components access operators	2021-12-28 04:07:09 +01:00
Alexander Zubkov	a2a268da4f	Filter: Add operators to pick community components Add operators that can be used to pick components from pair (standard community) or lc (large community) types. For example: (10, 20).asn --> 10 (10, 20).data --> 20 (10, 20, 30).asn --> 10 (10, 20, 30).data1 --> 20 (10, 20, 30).data2 --> 30 Signed-off-by: Alexander Zubkov <green@qrator.net>	2021-12-28 04:07:00 +01:00
Ondrej Zajicek (work)	a39cd2cc0b	BSD: Assume onlink flag on ifaces with only host addresses The BSD kernel does not support the onlink flag and BIRD does not use direct routes for next hop validation, instead depends on interface address ranges. We would like to handle PtMP cases with only host addresses configured, like: ifconfig wg0 192.168.0.10/32 route add 192.168.0.4 -iface wg0 route add 192.168.0.8 -iface wg0 To accept BIRD routes with onlink next-hop, like: route 192.168.42.0/24 via 192.168.0.4%wg0 onlink BIRD would dismiss the route when receiving from the kernel, as the next-hop 192.168.0.4 is not part of any interface subnet and onlink flag is not kept by the BSD kernel. The commit fixes this by assuming that for routes received from the kernel, any next-hop is onlink on ifaces with only host addresses. Thanks to Stefan Haller for the original patch.	2021-12-27 21:00:04 +01:00
Job Snijders	b9f38727a7	RPKI: Add contextual out-of-bound checks in RTR Prefix PDU handler RFC 6810 and RFC 8210 specify that the "Max Length" value MUST NOT be less than the Prefix Length element (underflow). On the other side, overflow of the Max Length element also is possible, it being an 8-bit unsigned integer allows for values larger than 32 or 128. This also implicitly ensures there is no overflow of "Length" value. When a PDU is received where the Max Length field is corrputed, the RTR client (BIRD) should immediately terminate the session, flush all data learned from that cache, and log an error for the operator. Minor changes done by commiter.	2021-12-18 16:35:28 +01:00
Simon Ruderich	00410fd6c1	Doc: bgp: remove "advertise ipv4" The option was removed in `d15b0b0a` ("BGP redesign", 2016-12-07) but the documentation wasn't updated.	2021-12-18 03:17:48 +01:00
Ondrej Zajicek (work)	b21104c97e	Nest: Do not ignore secondary flag changes in ifa updates Compare all IA_* flags that are set by sysdep iface code. The old code ignores IA_SECONDARY flag when comparing whether iface address updates from kernel changed anything. This is usually not an issue as kernel removes all secondary addresses due to removal of the primary one, but it breaks when sysctl 'promote_secondaries' is enabled and kernel promotes secondary addresses to primary ones. Thanks to 'Alexander' for the bugreport.	2021-12-18 01:09:52 +01:00
Ondrej Zajicek (work)	78ddfd2600	Trie: Clarify handling of less-common net types For convenience, Trie functions generally accept as input values not only NET_IPx types of nets, but also NET_VPNx and NET_ROAx types. But returned values are always NET_IPx types.	2021-12-02 03:35:29 +01:00
Maria Matejka	f772afc525	Memory statistics split into Effective and Overhead This feature is intended mostly for checking that BIRD's allocation strategies don't consume much memory space. There are some cases where withdrawing routes in a specific order lead to memory fragmentation and this output should give the user at least a notion of how much memory is actually used for data storage and how much memory is "just allocated" or used for overhead. Also raising the "system allocator overhead estimation" from 8 to 16 bytes; it is probably even more. I've found 16 as a local minimum in best scenarios among reachable machines. I couldn't find any reasonable method to estimate this value when BIRD starts up. This commit also fixes the inaccurate computation of memory overhead for slabs where the "system allocater overhead estimation" was improperly added to the size of mmap-ed memory.	2021-11-27 22:54:15 +01:00
Ondrej Zajicek (work)	14fc24f3a5	Trie: Implement longest-prefix-match queries and walks The prefix trie now supports longest-prefix-match query by function trie_match_longest_ipX() and it can be extended to iteration over all covering prefixes for a given prefix (from longest to shortest) using TRIE_WALK_TO_ROOT_IPx() macro.	2021-11-26 03:26:36 +01:00
Maria Matejka	644e9ca94e	Directly mapped pages are kept for future use if temporarily not needed	2021-11-24 19:42:52 +00:00
Maria Matejka	df476c2e5d	Corking also feed start to keep BIRD running when refeeds would easily cause congestion	2021-11-22 19:05:44 +01:00
Maria Matejka	0fd1c1d091	Route attribute cache is now lockless on read / clone. Lots of time was spent locking when accessing route attribute cache. This overhead should be now reduced to a minimum.	2021-11-22 19:05:44 +01:00
Maria Matejka	adf37d8eff	VRF setting reduced to one argument, using default dummy iface for default vrf	2021-11-22 19:05:44 +01:00
Maria Matejka	dc160e11e1	Route table import-to-export announcement indirection to reduce pipe traffic	2021-11-22 19:05:44 +01:00
Maria Matejka	4f3fa1623f	Pipe runs in parallel.	2021-11-22 19:05:44 +01:00
Maria Matejka	878eeec12b	Routing tables now have their own loops. This basically means that: * there are some more levels of indirection and asynchronicity, mostly in cleanup procedures, requiring correct lock ordering * all the internal table operations (prune, next hop update) are done without blocking the other parts of BIRD * the protocols may get their own loops very soon	2021-11-22 19:05:44 +01:00
Maria Matejka	c7d0c5b252	Route subscription uses events	2021-11-22 19:05:44 +01:00
Maria Matejka	18f66055e3	Global table update pool removed	2021-11-22 19:05:44 +01:00
Maria Matejka	038fcf1c8b	Locking route attributes cache To access route attribute cache from multiple threads at once, we have to lock the cache on writing. The route attributes data structures are safe to read unless somebody tries to tamper with the cache itself.	2021-11-22 19:05:44 +01:00
Maria Matejka	f0507f05ce	Route sources have an explicit owner This commit prevents use-after-free of routes belonging to protocols which have been already destroyed, delaying also all the protocols' shutdown until all of their routes have been finally propagated through all the pipes down to the appropriate exports. The use-after-free was somehow hypothetic yet theoretically possible in rare conditions, when one BGP protocol authors a lot of routes and the user deletes that protocol by reconfiguring in the same time as next hop update is requested, causing rte_better() to be called on a not-yet-pruned network prefix while the owner protocol has been already freed. In parallel execution environments, this would happen an inter-thread use-after-free, causing possible heisenbugs or other nasty problems.	2021-11-22 19:05:44 +01:00
Maria Matejka	2a224a9e1e	Route sources have their separate global lock	2021-11-22 19:05:44 +01:00
Maria Matejka	794a4eefa1	Keeping un-unmmappable pages until they can be reused On Linux, munmap() may fail with ENOMEM when virtual memory is too fragmented. Working this around by just keeping such blocks for future use.	2021-11-22 19:05:44 +01:00
Maria Matejka	1b39473993	Introducing basic RCU primitives for lock-less shared data structures	2021-11-22 19:05:44 +01:00
Maria Matejka	3b20722a1f	Table cork: Stop creating updates when there are too many pending. The corked procedure gets a callback when uncorked. Supported by table maintenance routines and also BGP.	2021-11-22 19:05:43 +01:00
Maria Matejka	6e841b3153	Adding a generic cork mechanism for events	2021-11-22 19:05:43 +01:00
Maria Matejka	94eb0858c2	Converting the former BFD loop to a universal IO loop and protocol loop. There is a simple universal IO loop, taking care of events, timers and sockets. Primarily, one instance of a protocol should use exactly one IO loop to do all its work, as is now done in BFD. Contrary to previous versions, the loop is now launched and cleaned by the nest/proto.c code, allowing for a protocol to just request its own loop by setting the loop's lock order in config higher than the_bird. It is not supported nor checked if any protocol changed the requested lock order in reconfigure. No protocol should do it at all.	2021-11-22 19:05:43 +01:00
Maria Matejka	a4451535c6	Unified time for whole BIRD In previous versions, every thread used its own time structures, effectively leading to different time in every thread and strange logging messages. The time processing code now uses global atomic variables to keep current time available for fast concurrent reading and safe updates.	2021-11-22 19:05:43 +01:00
Maria Matejka	a845651bc5	Multithreaded BIRD needs reasonably new software to compile	2021-11-22 19:05:43 +01:00
Maria Matejka	c70b3198dc	Route export is now asynchronous. To allow for multithreaded execution, we need to break the import-export chain and buffer the exports before actually processing them.	2021-11-22 19:05:43 +01:00
Maria Matejka	f18968f52f	Better profylaction recursive route loops In some specific configurations, it was possible to send BIRD into an infinite loop of recursive next hop resolution. This was caused by route priority inversion. To prevent priority inversions affecting other next hops, we simply refuse to resolve any next hop if the best route for the matching prefix is recursive or any other route with the same preference is recursive. Next hop resolution doesn't change route priority, therefore it is perfectly OK to resolve BGP next hops e.g. by an OSPF route, yet if the same (or covering) prefix is also announced by iBGP, by retraction of the OSPF route we would get a possible priority inversion.	2021-11-22 19:05:43 +01:00
Maria Matejka	44f26c49f9	Special table hooks rectified. * internal tables are now more standalone, having their own import and export hooks * route refresh/reload uses stale counter instead of stale flag, allowing to drop walking the table at the beginning * route modify (by BGP LLGR) is now done by a special refeed hook, reimporting the modified routes directly without filters	2021-11-22 19:05:43 +01:00
Maria Matejka	445eeaf3df	Split route table event into separate events The former rt_event is dropped in favour of separate table events. This allows for selective corking of NHU and prune.	2021-11-22 19:05:43 +01:00
Maria Matejka	c84ed60371	Moved BFD IO loop out of BFD as we want to use it as socket-io coroutine	2021-11-22 19:05:43 +01:00
Maria Matejka	a2af807357	Debug messages with timestamps. On most of current hardware, getting monotonic clock is fast enough to get it and write for each debug message.	2021-11-22 19:05:43 +01:00
Maria Matejka	8d706aedba	Fixing expensive list checks. Debug only commit.	2021-11-22 19:05:43 +01:00
Maria Matejka	df3264f51f	Lock position checking allows for safe lock unions	2021-11-22 19:05:43 +01:00
Maria Matejka	b5ca6a79d3	GDB: SKIP_BACK and linked list tools	2021-11-22 19:05:43 +01:00

... 5 6 7 8 9 ...

4161 Commits