mirrors/bird - bird

mirror of https://gitlab.nic.cz/labs/bird.git synced 2024-12-23 10:11:53 +00:00

Author	SHA1	Message	Date
Ondrej Zajicek	6a242b3ec6	IO: Fix race condition in event processing When regular event was added from work event, we did remember that regular event list was empty and therefore we did not use zero time in poll(). This leads to ~3 s latency in route reload during reconfiguration.	2023-10-04 17:36:03 +02:00
Ondrej Zajicek	333ddd4f98	MPLS subsystem The MPLS subsystem manages MPLS labels and handles their allocation to MPLS-aware routing protocols. These labels are then attached to IP or VPN routes representing label switched paths -- LSPs. There was already a preliminary MPLS support consisting of MPLS label net_addr, MPLS routing tables with static MPLS routes, remote labels in next hops, and kernel protocol support. This patch adds the MPLS domain as a basic structure representing local label space with dynamic label allocator and configurable label ranges. To represent LSPs, allocated local labels can be attached as route attributes to IP or VPN routes with local labels as attributes. There are several steps for handling LSP routes in routing protocols -- deciding to which forwarding equivalence class (FEC) the LSP route belongs, allocating labels for new FECs, announcing MPLS routes for new FECs, attaching labels to LSP routes. The FEC map structure implements basic code for managing FECs in routing protocols, therefore existing protocols can be made MPLS-aware by adding FEC map and delegating most work related to local label management to it.	2023-10-04 13:01:21 +02:00
Maria Matejka	51f2e7afaf	Conf: Symbol manipulation gets its context explicitly	2023-09-12 15:36:46 +02:00
Maria Matejka	8659818391	Conf: Adding dummy thread-number setting for easier sharing of configuration between v2 and v3	2023-09-12 14:53:55 +02:00
Toke Høiland-Jørgensen	d8cf3cad51	IO: Add current_time_now() function for immediate timestamp Add a current_time_now() function which gets an immediate monotonic timestamp instead of using the cached value from the event loop. This is useful for callers that need precise times, such as the Babel RTT measurement code. Minor changes by committer.	2023-06-02 00:26:41 +02:00
Ondrej Zajicek	6b38285f58	Net: Replace runtime checks with STATIC_ASSERT()	2023-03-06 11:57:40 +01:00
Ondrej Zajicek	804916daa9	Alloc: Minor cleanups - Fix THP disable on old systems - Failed syscalls should use die() instead of bug() - Our printf uses %ld for s64 instead of long	2023-01-18 13:40:21 +01:00
Maria Matejka	6bb992cb04	Merge branch 'master' of https://gitlab.nic.cz/labs/bird	2023-01-18 12:33:06 +01:00
Maria Matejka	973aa37e1e	Fix memory pre-allocation When BIRD has no free memory mapped, it allocates several pages in advance just to be sure that there is some memory available if needed. This hysteresis tactics works quite well to reduce memory ping-ping with kernel. Yet it had a subtle bug: this pre-allocation didn't take a memory coldlist into account, therefore requesting new pages from kernel even in cases when there were other pages available. This led to slow memory bloating. To demonstrate this behavior fast enough to be seen well, you may: * temporarily set the values in sysdep/unix/alloc.c as follows to exacerbate the issue: #define KEEP_PAGES_MAIN_MAX 4096 #define KEEP_PAGES_MAIN_MIN 1000 #define CLEANUP_PAGES_BULK 4096 * create a config file with several millions of static routes * periodically disable all static protocols and then reload config * log memory consumption This should give you a steady growth rate of about 16kB per cycle. If you don't set the values this high, the issue happens much more slowly, yet after 14 days of running, you are going to see an OOM kill. After this fix, pre-allocation uses the memory coldlist to get some hot pages and the same test as described here gets you a perfectly stable constant memory consumption (after some initial wobbling). Thanks to NIX-CZ for reporting and helping to investigate this issue. Thanks to Santiago for finding the cause in the code.	2023-01-18 09:39:45 +01:00
Ondrej Zajicek	928a1cb034	Alloc: Disable transparent huge pages The usage pattern implemented in allocator seems to be incompatible with transparent huge pages, as memory released using madvise(MADV_DONTNEED) with regular page size and alignment does not seem to trigger demotion of huge pages back to regular pages, even when significant number of pages is released. Even if demotion is triggered when system memory is low, it still breaks memory accounting.	2023-01-17 17:13:50 +01:00
Mike Crute	64a2b7aaa3	Log message before aborting Log message before aborting due to watchdog timeout. We have to use async-safe write to debug log, as it is done in signal handler. Minor changes from committer.	2023-01-12 17:40:53 +01:00
Ondrej Zajicek	4c19a8a984	CLI: Fix for long-lived sessions during high loads When there is a continuos stream of CLI commands, cli_get_command() always returns 1 (there is a new command). Anyway, the socket receive buffer was reset only when there was no command at all, leading to a strange behavior: after a while, the CLI receive buffer came to its end, then read() was called with zero size buffer, it returned 0 which was interpreted as EOF. The patch fixes that by resetting the buffer position after each command and moving remaining data at the beginning of buffer. Thanks to Maria Matejka for examining the bug and for the original bugfix.	2022-12-10 17:32:42 +01:00
Ondrej Zajicek	543c8ba097	BSD: Fix krt socket code w.r.t. rte/rta changes	2022-11-30 02:43:39 +01:00
Ondrej Zajicek	bbac9ca958	Conf: Make 'configure check' command restricted While it does not directly change BIRD state, it can trigger reading arbitrary files and eating significant memory.	2022-11-09 22:02:46 +01:00
Ondrej Zajicek	371eb49043	Conf: Free stored old config before parsing new one BIRD keeps a previous (old) configuration for the purpose of undo. The existing code frees it after a new configuration is successfully parsed during reconfiguration. That causes memory usage spikes as there are temporarily three configurations (old, current, and new). The patch changes it to free the old one before parsing the new one (as user already requested a new config). The disadvantage is that undo is not available after failed reconfiguration.	2022-11-09 21:54:45 +01:00
Maria Matejka	57308fb277	Page allocator: Fixed minor bugs and added commentary	2022-11-03 12:38:57 +01:00
Maria Matejka	9d03c3f56c	Memory pages are not munmapped, instead we just madvise() Memory unmapping causes slow address space fragmentation, leading in extreme cases to failing to allocate pages at all. Removing this problem by keeping all the pages allocated to us, yet calling madvise() to let kernel dispose of them. This adds a little complexity and overhead as we have to keep the pointers to the free pages, therefore to hold e.g. 1 GB of 4K pages with 8B pointers, we have to store 2 MB of data.	2022-11-02 12:56:54 +01:00
Alexander Zubkov	0f2be469f8	KRT: Fix setting default preference Changes in commit `eb937358` broke setting of channel preference for alien routes learned during scan. The preference was set only for async routes. Move common attribute processing part of functions krt_learn_async() and krt_learn_async() to a separate function to have only one place for such changes.	2022-09-27 11:33:41 +02:00
Maria Matejka	dc28c6ed1c	Simplified the protocol hookup code in Makefiles	2022-08-18 22:07:30 +02:00
Ondrej Zajicek	082905a833	Merge branch 'master' into backport	2022-07-27 00:47:24 +02:00
Ondrej Zajicek	534d0a4b44	KRT: Scan routing tables separetely on linux to avoid congestion Remove compile-time sysdep option CONFIG_ALL_TABLES_AT_ONCE, replace it with runtime ability to run either separate table scans or shared scan. On Linux, use separate table scans by default when the netlink socket option NETLINK_GET_STRICT_CHK is available, but retreat to shared scan when it fails. Running separate table scans has advantages where some routing tables are managed independently, e.g. when multiple routing daemons are running on the same machine, as kernel routing table modification performance is significantly reduced when the table is modified while it is being scanned. Thanks Daniel Gröber for the original patch and Toke Høiland-Jørgensen for suggestions.	2022-07-24 02:15:20 +02:00
Maria Matejka	2e5bfeb73a	Merge remote-tracking branch 'origin/master' into backport	2022-07-11 11:08:10 +02:00
Maria Matejka	d429bc5c84	Merge commit 'beb5f78a' into backport	2022-07-11 10:41:17 +02:00
Maria Matejka	7e9cede1fd	Merge version 2.0.10 into backport	2022-07-10 14:19:24 +02:00
Ondrej Zajicek (work)	946cedfcfe	Filter: Implement soft scopes Soft scopes are anonymous scopes that most likely do not contain any symbol, so allocating regular scope is postponed when it is really needed.	2022-06-27 21:13:31 +02:00
Maria Matejka	beb5f78ada	Preexport callback now takes the channel instead of protocol as argument Passing protocol to preexport was in fact a historical relic from the old times when channels weren't a thing. Refactoring that to match current extensibility needs.	2022-06-27 19:04:24 +02:00
Ondrej Zajicek	f39e9aa203	IO: Improve resolution of latency debugging messages	2022-06-04 17:54:08 +02:00
Maria Matejka	9eec503b25	Fixed a munmap abort bug When BIRD was munmapping too many pages, it sometimes aborted, saying that munmap failed with "Not enough memory" as the address space was getting more and more fragmented. There is a workaround in place, simply keeping that page for future use, yet it has never been compiled in because I somehow forgot to include errno.h. And because I also thought that somebody may have ENOMEM not defined (why?!), there was a check which quietly omitted that workaround. Anyway, ENOMEM is POSIX. It's an utter nonsense to check for its existence. If it doesn't exist, something is broken.	2022-04-13 11:36:54 +02:00
Maria Matejka	4a23ede2b0	Protocols have their own explicit init routines	2022-04-06 18:14:08 +02:00
Maria Matejka	4e60b3ee72	Fixed a static assert in page allocator	2022-03-09 13:28:03 +01:00
Maria Matejka	19e727a248	Merge commit '60880b539b8886f76961125d89a265c6e1112b7a' into haugesund	2022-03-09 11:29:56 +01:00
Maria Matejka	24773af9e0	Merge commit 'e42eedb9' into haugesund	2022-03-09 11:02:55 +01:00
Maria Matejka	83d9920f90	Merge commit '5cff1d5f' into haugesund Conflicts: proto/bgp/attrs.c proto/pipe/pipe.c	2022-03-09 10:56:06 +01:00
Maria Matejka	ff47cd80dd	Merge commit 'd5a32563' into haugesund	2022-03-09 10:50:38 +01:00
Maria Matejka	9e60a1fbc3	Fixed resource initialization in unit tests	2022-03-09 10:30:42 +01:00
Maria Matejka	eeec9ddbf2	Merge commit '0c59f7ff' into haugesund	2022-03-09 09:13:55 +01:00
Maria Matejka	0c59f7ff01	Revert "Bound allocated pages to resource pools with page caches to avoid unnecessary syscalls" This reverts commit `7f0e598208`.	2022-03-09 09:13:31 +01:00
Maria Matejka	1c7df2c240	Revert "Multipage allocation" This reverts commit `6cd3771378`.	2022-03-09 09:13:20 +01:00
Maria Matejka	c78247f9b9	Single-threaded version of sark-branch memory page management	2022-03-09 09:10:44 +01:00
Maria Matejka	48bf1322aa	Introducing an universal temporary linpool flushed after every task	2022-03-02 12:13:49 +01:00
Maria Matejka	d071aca7aa	Merge commit '2c13759136951ef0e70a3e3c2b2d3c9a387f7ed9' into haugesund	2022-03-02 10:01:44 +01:00
Ondrej Zajicek (work)	2fc8b4c4ba	Alloc: Use posix_memalign() instead of aligned_alloc() For compatibility with older systems use posix_memalign(). We can switch to aligned_alloc() when we commit to C11 for multithreading.	2022-02-08 22:42:00 +01:00
Alexander Zubkov	87a02489f3	IO: Support nonlocal bind in socket interface Add option to socket interface for nonlocal binding, i.e. binding to an IP address that is not present on interfaces. This behaviour is enabled when SKF_FREEBIND socket flag is set. For Linux systems, it is implemented by IP_FREEBIND socket flag. Minor changes done by commiter.	2022-01-08 19:02:31 +01:00
Maria Matejka	644e9ca94e	Directly mapped pages are kept for future use if temporarily not needed	2021-11-24 19:42:52 +00:00
Maria Matejka	e42eedb912	Kernel: Convert the rte-local attributes to extended attributes and flags to pflags	2021-10-13 19:09:04 +02:00
Maria Matejka	5cff1d5f02	Route: moved rte_src pointer from rta to rte It is an auxiliary key in the routing table, not a route attribute.	2021-10-13 19:09:04 +02:00
Maria Matejka	d5a32563df	Preexport: No route modification, no linpool needed	2021-10-13 19:09:04 +02:00
Maria Matejka	541881bedf	RIP fixup + dropping the tmp_attrs mechanism as obsolete	2021-10-13 19:09:04 +02:00
Maria Matejka	eb937358c0	Preference moved to RTA and set explicitly in protocols	2021-10-13 19:09:04 +02:00
Maria Matejka	6cd3771378	Multipage allocation We can also quite simply allocate bigger blocks. Anyway, we need these blocks to be aligned to their size which needs one mmap() two times bigger and then two munmap()s returning the unaligned parts. The user can specify -B <N> on startup when <N> is the exponent of 2, setting the block size to 2^N. On most systems, N is 12, anyway if you know that your configuration is going to eat gigabytes of RAM, you are almost forced to raise your block size as you may easily get into memory fragmentation issues or you have to raise your maximum mapping count, e.g. "sysctl vm.max_map_count=(number)".	2021-10-13 19:01:22 +02:00

1 2 3 4 5 ...

570 Commits