Fluent Community #fluent-bit

Victor Nilsson

11/12/2025, 2:02 PM

Hey 🤠 We have a large fleet of servers that is running fluent-bit as a docker container, with systemd input. Whenever we push out changes to the configuration of fluent-bit, ansible will replace the container with a new one. This generates a lot of systemd logs that fluent-bit is sending to opensearch. The massive influx of systemd logs seems like maybe fluent-bit is resending already processed logs. This is how we have configured the pipeline for systemd logs:

Copy code

---
pipeline:
  inputs:
    - name: systemd
      tag: systemd.*
      read_from_tail: on
      threaded: true
      lowercase: on
      db: /fluent-bit/db/systemd.db
      storage.type: memory # Filesystem buffering is not needed for tail input since the files are stored locally.
      mem_buf_limit: 250M
      alias: in_systemd

We have set

db

as well as

read_from_tail: on

so our thoughts were that the fluent-bit container should not resend already processed logs, is this true?

Andrew Elwell

11/13/2025, 2:31 AM

is that DB contained within the (replaced) container? if so then the fresh container won't necessarily know what line the old one was up to

Michael Marshall

11/13/2025, 3:27 PM

is anyone sucessfully using the in_splunk plugin to ingest logs from splunk over HEC? if yes, what version of fluent-bit and what version of splunk?

DennyF

11/13/2025, 3:44 PM

Megha Aggarwal

11/13/2025, 7:18 PM

Hello team! I am playing around with https://docs.fluentbit.io/manual/data-pipeline/inputs/prometheus-scrape-metrics Trying to understand what all potential "output" alternatives we have for this? Is there some otel-native exporter supported for getting the metrics out?

Gabriel Alacchi

11/13/2025, 10:02 PM

I wasn't quite sure whether to raise a GitHub issue for this just yet since I don't believe it's a fluentbit problem per-se. Under high traffic volume with

storage.type=filesystem

we see a rapid leak of memory in-use by the fluent-bit pod in k8s. Growing to as much as 16GB after 1d or so without a pod restart. We see that fluent-bit itself is not consuming much memory, maybe a few hundred MB, but rather that kernel slab associated to the container cgroup is accounting for all of the excess memory.

slabtop

claims that VFS dentry cache is accounting for all of those leaked kernel objects. The behavior we're seeing is that since we are running buffering a large # of chunks/sec, we are creating easily hundreds of chunk files per second which leaks dentry entries rather quickly. Even upon file deletion the kernel will keep negative dentries which cache the non-existence of a file, and they aren't purged from kernel cache all that easily unless the system is under memory pressure. More context on this topic: https://lwn.net/Articles/894098/ Is this dentry cache bloat a well-known problem in the fluent-bit community? Are there good solutions / workarounds? Some workarounds we've considered, but are looking for guidance from maintainers & community: 1. Raise VFS cache pressure on the nodes. I'm not 100% sure on how much this changes VFS cache behavior here, and not sure what perf consequences this can have on the rest of workloads on the Node. It's worth experimenting with. 2. Periodically reboot fluent-bit pods. This resets its memory accounting, however doesn't actually clean up the bloat in the dentry cache since it's a system wide cache. If our system gets into memory pressure, the sheer volume of dentry entries could lock-up the system. Feels like sweeping a bigger problem under the rug. 3. Periodically migrate fluent-bit storage directory to another directory and delete the old directory. Supposedly when a directory is deleted, a negative dentry is kept for it, but nested entries are pruned since they are now made redundant. I think this is the most plausible option since we can add a script wrapper around fluent-bit to gracefully shut it down, reconfigure, and re-start, no code changes are required in fluent-bit itself. How do we handle periods of backpressure when there is an existing backlog of chunks? One idea to improve things within fluent-bit itself would be to re-use chunk file names so those cached dentries can be re-used. Either that, or use larger pre-allocated files with block-arena like memory management to store FS chunks. This may be more efficient? You can always add more files or extend the block arena if the FS storage buffer needs to grow. CC @Pandu Aji

Rafael Martinez Guerrero

11/14/2025, 1:50 PM

Hello, would it be possible to open https://github.com/fluent/fluent-bit/issues/11068 (Fluent-bit crashes with a coredump when running on RHEL10) again? The bug is still active and fluent-bit (4.1.1 and 4.2.0) still crash with a core dump when using it with the input systemd.

Phil Wilkins

11/14/2025, 9:43 PM

@here Will the documentation for 4.2 be released to the main website?

Andrew Elwell

11/16/2025, 10:05 PM

OK,Dumb Q, but what's the advantage of one over the other between using a) the built in monitoring

Copy code

service:
  http_server: on
  http_listen: 0.0.0.0
  http_port: 2020

https://docs.fluentbit.io/manual/administration/monitoring vs https://docs.fluentbit.io/manual/data-pipeline/inputs/fluentbit-metrics and sending those to a prometheus_exporter as described in the docs? Does one have more things? better coverage of pipeline/ consume fewer resources?

Andrew Elwell

11/16/2025, 10:35 PM

also it looks like the prometheus_exporter just ignores anything in the query URI providing it ends in

/metrics

- is this expected?

Copy code

[aelwell@admiral ~]$ curl -s <http://127.0.0.1:2021/blah/randomshit../../../../../../../../metrics> | head
# HELP fluentbit_uptime Number of seconds that Fluent Bit has been running.
# TYPE fluentbit_uptime counter
fluentbit_uptime{hostname="admiral"} 122936
# HELP fluentbit_logger_logs_total Total number of logs
# TYPE fluentbit_logger_logs_total counter
fluentbit_logger_logs_total{message_type="error"} 0
fluentbit_logger_logs_total{message_type="warn"} 0
fluentbit_logger_logs_total{message_type="info"} 20
fluentbit_logger_logs_total{message_type="debug"} 0
fluentbit_logger_logs_total{message_type="trace"} 0

ah, there's a choice of only

Copy code

static void cb_metrics(mk_request_t *request, void *data)
static void cb_root(mk_request_t *request, void *data)

Sagi Rosenthal

11/17/2025, 9:30 AM

Anyone experienced in compiling and testing on Mac with M2 (macos apple silicone)? I'm running this

cd build && cmake .. -DFLB_TESTS_RUNTIME=On && make

and getting lib_crypto issues:

Copy code

Undefined symbols for architecture arm64:
  "_EVP_MD_size", referenced from:
      _flb_hmac_init in flb_hmac.c.o
      _flb_hash_init in flb_hash.c.o
  "_EVP_PKEY_size", referenced from:
      _flb_crypto_init in flb_crypto.c.o
ld: symbol(s) not found for architecture arm64
clang++: error: linker command failed with exit code 1 (use -v to see invocation)
make[2]: *** [lib/libfluent-bit.dylib] Error 1
make[1]: *** [src/CMakeFiles/fluent-bit-shared.dir/all] Error 2
make: *** [all] Error 2

Megha Aggarwal

11/17/2025, 7:12 PM

Hi team, What is the general opinion in the community around using fluentbit for all telemetry including logs, specifically metrics? Do you know if people are more inclined to use otel-collector for metrics or is fluent-bit also used extensively?

Sanath Ramesh

11/18/2025, 9:00 AM

hi team, We are facing data loss when fluent bit is restarted - the logs produced between stopping and starting of the fluent bit agent are lost, we noticed that the db entry gets deleted and recreated every time. Please find the details below Host : amazon linux 2 Architecture : x86_64 Fluent bit version : 3.2.10 Input config :

Copy code

[INPUT]
    Name tail
    Path /var/log/cloud-init.log
    Buffer_Max_Size 128k
    Mem_Buf_Limit 16384k
    Skip_Long_Lines On
    Path_Key filePath
    Tag  cloud-init.log
    DB   /var/db/newrelic-infra/newrelic-integrations/logging/fb.db

[INPUT]
    Name tail
    Path /var/log/messages
    Buffer_Max_Size 128k
    Mem_Buf_Limit 16384k
    Skip_Long_Lines On
    Path_Key filePath
    Tag  messages
    DB   /var/db/newrelic-infra/newrelic-integrations/logging/fb.db

[INPUT]
    Name tail
    Path /var/log/secure
    Buffer_Max_Size 128k
    Mem_Buf_Limit 16384k
    Skip_Long_Lines On
    Path_Key filePath
    Tag  secure
    DB   /var/db/newrelic-infra/newrelic-integrations/logging/fb.db

[INPUT]
    Name tail
    Path /var/log/yum.log
    Buffer_Max_Size 128k
    Mem_Buf_Limit 16384k
    Skip_Long_Lines On
    Path_Key filePath
    Tag  yum.log
    DB   /var/db/newrelic-infra/newrelic-integrations/logging/fb.db

[INPUT]
    Name tail
    Path /root/.newrelic/newrelic-cli.log
    Buffer_Max_Size 128k
    Mem_Buf_Limit 16384k
    Skip_Long_Lines On
    Path_Key filePath
    Tag  newrelic-cli.log
    DB   /var/db/newrelic-infra/newrelic-integrations/logging/fb.db

Copy code

Before Agent stops (fb.db):



sqlite> select * from in_tail_files where name = '/var/log/test.log';
id|name|offset|inode|created|rotated
33|/var/log/test.log|85|12713160|1762270005|0
After Infrastructure Agent starts (fb.db):


After Agent starts 

sqlite> select * from in_tail_files where name = '/var/log/test.log';
id|name|offset|inode|created|rotated
41|/var/log/test.log|114|12713160|1762271396|0

Please notice the change in created at time stamp We tried the same on other Operating Systems and architectures but they all seem to be working fine.

Phil Wilkins

11/18/2025, 11:59 AM

All - Ive posted a few blogs about recent features for Fluent Bit, with another coming today - https://blog.mp3monster.org/category/technology/fluentbit/

Celalettin

11/18/2025, 12:53 PM

Fluentbit.io (including packages) is down because of cloudflare outage fyi. As far as understand not from every location but if you see issues heads up. https://www.cloudflarestatus.com/incidents/8gmgl950y3h7

👍 1

Denis

11/19/2025, 7:58 AM

Hi All, im new to fluent-bit and have maybe a total dumb question. It seems, that all file hashes of https://docs.fluentbit.io/manual/installation/downloads/windows#installation-packages are wrong. I double checked it with Virustotal (just for the hashes and to make sure, its not my local configuration): https://www.virustotal.com/gui/url/a7ae85576659a037af666cfe234cda369f9738977977929e217a7805a2395d8b/details It would be great if anyone can explain or confirm. Thanks Denis

Anders

11/19/2025, 8:08 AM

Anyone else noticed systemd input causing fairly large increase in virtual memory used by the fluent-bit process? Without systemd-input:

Copy code

USER         PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
root      105637  0.3  1.1 133760 43556 ?        Ssl  08:51   0:00 /opt/fluent-bit/bin/fluent-bit -c /etc/fluent-bit/fluent-bit.yaml

With systemd-input

Copy code

USER         PID %CPU %MEM    VSZ   RSS TTY      STAT START   TIME COMMAND
root      105691  0.6  1.6 614252 63588 ?        Ssl  08:52   0:00 /opt/fluent-bit/bin/fluent-bit -c /etc/fluent-bit/fluent-bit.yaml

systemd-input configuration

Copy code

pipeline:
  inputs:
    ...
    - name: systemd
      tag: 'log.systemd.*'
      db: /var/spool/fluent-bit/logs.db
      lowercase: true
      strip_underscores: true
      mem_buf_limit: 50MB
      storage.type: memory
    ...

DennyF

11/19/2025, 1:23 PM

DennyF

11/19/2025, 1:25 PM

I try to read logs from systemd for vault.service .. but I'm not able to send output to Graylog / Victoria-logs. I don't see anything coming. I use something like: /insert/jsonline?_msg_field=log and Gelf_Short_Message_Key: log. Can it be, that I need to use something different?

devsecops

11/19/2025, 1:50 PM

I’m looking to implement an inline hostname-to-IP mapping inside Fluent Bit. For example: Device = {"mylaptop": "192.168.0.100"} The goal is to enrich incoming logs by injecting a device_ip field based on this lookup. What’s the most efficient and scalable approach to achieve this—is there a more optimal native method within Fluent Bit’s pipeline?

👍 1

DennyF

11/19/2025, 2:40 PM

Next question 🙂 My default input is also syslog and I see, that I also get logs from haproxy in it, which I'm reading already via tail .. what is the best way to drop messages with "tag: haproxy" ? Input is:

Copy code

pipeline:
  inputs:
    - name: syslog
      listen: 127.0.0.1
      port: 5140
      parser: syslog_rfc3339
      tag: syslog
      mode: udp
      buffer_chunk_size: 32000
      buffer_max_size: 64000
      receive_buffer_size: 512000

Anton Todorov

11/21/2025, 6:59 AM

Hello all, I am deploying FluentBit on an Azure Kubernetes Service cluster, aiming to collect logs per namespace. The deployment is via Terragrunt, Terraform and HELM. Currently I am able to deliver with OUTPUT "azure", but that only records to Analytics tables in the Log Analytics. Hence, the aim is that FluentBit directs its logs to Basic tables within particular Log Analytics instances. My struggles: • Lowering the volumes and amounts of unwanted data (metadata) to be delivered - may be my FILTER is weak? Or the PARSER?! • Delivering to Basic tables within a Log Analytics - I am trying to use the "azure_logs_ingestion" OUTPUT, but for some reason I am receiving the errors per the screenshot. Any help will be highly valuable and if by any chance I get to speak to someone. P.S. I am new to Slack, so please have it in mind. Thank you all in advance! 🙂

Eric D. Schabell

11/21/2025, 9:52 AM

My first Fluent Bit code PR ready for review: https://github.com/fluent/fluent-bit/pull/11192

🙌 2

k8tgreenley

11/21/2025, 7:05 PM

<!here> 🚨 Fluent Bit Community Meetup – is starting now! Come join us for a casual meetup where we’ll cover: • KubeCon + CloudNativeCon NA 2025 • Highlights from sessions and demos • Fluent Bit v4.2 updates & what’s next • Real-world lessons from the community Bring your stories, questions, and curiosity—see you there! 🎉 📍 https://chronosphere-io.zoom.us/j/83395081381?pwd=CHTz2UNyyBOzC4giPDBeGljbzVPvpg.1

Eric D. Schabell

11/24/2025, 11:37 AM

On a bit of a roll, dealing with issues (closing) now that I’m back in my home office: • https://github.com/fluent/fluent-bit-docs/issues/2181 • https://github.com/fluent/fluent-bit-docs/issues/2184 (also back ported to 4.1) • https://github.com/fluent/fluent-bit-docs/issues/2185

Shelby Hagman

11/24/2025, 5:42 PM

Just a note, we recently released AWS for Fluent Bit 3.1.0 which includes fluent-bit 4.2.0

👍 1

🍻 1

Eric D. Schabell

11/25/2025, 9:35 AM

Took a swing at adding rather extensive documentation for the new Fluent Bit 4.2 conditional routing features alongside the traditional tag-based routing, see what you think or if I am missing anything (it covers examples for all supported telemetry types): https://docs.fluentbit.io/manual/data-pipeline/router

🙌 1

Eric D. Schabell

11/25/2025, 12:11 PM

Raised an issue (https://github.com/fluent/fluent-bit/issues/11204) and provided corresponding code PR for review: https://github.com/fluent/fluent-bit/pull/11206

DennyF

11/26/2025, 9:10 AM

Good morning, I have issues with GELF output, and for some messages I get that short_message is missing, or

Unrecognized character escape 'C' (code 67)

and I have no idea .. which messages, because Graylog dropping them. So I need a way to get the exact output, which the Graylog gets. I tried TCPdump, but because the host gets so many messages, that I can't see, which one is the problematic one.

Eric D. Schabell

11/26/2025, 11:03 AM

Starting to track the docs updates with a Fluent Bit 4.2 milestone, follow along and jump in if you like on any of the open issues!