-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Fluent bit stops uploading logs when there are too many open chunks #3541
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Comments
To Reproduce
|
Seems to be the same as #3338… |
This is affecting GCP customer(s). |
Hello @MPeli I am following the reproducer procedure outlined in your last comment but on linux using the latest master build. So far the issue isn't reproducible, can you please review the following procedure and
$ ls -lh /data/*.xlog.log | wc -l
120
$ du -ach /data/*.xlog.log | grep -i total
606M total 2.1 set storage.max_chunks_up 32 $ docker-compose logs fluentbit | grep -i chunks
fluentbit | [2021/06/09 00:16:38] [ info] [storage] normal synchronization mode, checksum disabled, max_chunks_up=32
[SERVICE]
Flush 5.000000
Daemon Off
Log_Level Trace
storage.max_chunks_up 32
[INPUT]
Name tail
Mem_Buf_limit 95.4M
Path /data/*.xlog.log
Multiline off
Parser_Firstline xlog_parser
Path_Key path
Offset_Key 1
DB fluent.db
Read_from_Head On
Buffer_Chunk_Size 128k
Buffer_Max_Size 256k
Ignore_Older 10d
[OUTPUT]
Name es
Match *
Host elasticsearch-master
Port 9200
Index fluentbit |
We observed the same issue with 1.7.6, but it was fixed by 1.7.8. Please give that a try. |
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days. |
This issue was closed because it has been stalled for 5 days with no activity. |
This doesn't seem to be fixed in 1.7.8 or 1.8.4. I tested a custom build of 1.8.4 w/ chunkio 1.1.5 on Windows 2016, and when I forced it to lose connection (by messing up DNS resolver), the CPU was driven way up, logs started growing with errors, and FB kept restarting. Some examples of logs:
After a bit, started to see Links
Steps to reproduce
|
Also a while ago I tested this on Fluent Bit 1.7.8 (via Ops Agent 2.0.3) and it failed in the same way. |
Bug Report
Describe the bug
Fluent bit stops uploading logs when there are too many open chunks.
To Reproduce
Not really sure how. I will add more details once I found more. I happens on a server where are 1500 log files in a folder (not sure if it matters)
Expected behavior
Fluent bit should not get stuck and try to reconnect.
Your Environment
fluent-bit-too-many-open-chunks.log
The text was updated successfully, but these errors were encountered: