Commit 46d8616
[pkg/stanza/fileconsumer] Optimize indexed fingerprint matching (#47660)
This PR was implemented as part of #45203 but decided to split the
changes done to make them easier to review.
Updates
#27404
Changes fingerprint matching in `fileset`. Before this change, matching
was done with linear scans over the tracked files. After this change,
matching uses an internal index based on fingerprint length and
fingerprint content, so exact and prefix matches no longer depend on
scanning the full set every time.
As part of that implementation, `Fileset.Pop()` changes from removing
the first element to removing the last element. In other words, it
changes from FIFO behavior to LIFO behavior. This is an internal change
that makes removal O(1) and fits the indexed structure.
The benchmark was run before and after the change with `go test
./fileconsumer -run '^$' -bench BenchmarkPollManyFiles -benchmem
-count=5`.
| Files watched | Before | After | Improvement |
| --- | ---: | ---: | ---: |
| 100 | 2.448 ms/op | 2.196 ms/op | 10.3% |
| 500 | 12.27 ms/op | 10.77 ms/op | 12.3% |
| 1000 | 28.20 ms/op | 21.91 ms/op | 22.3% |
| 2000 | 64.59 ms/op | 44.90 ms/op | 30.5% |
| 2500 | 87.65 ms/op | 57.22 ms/op | 34.7% |
| 3000 | 112.98 ms/op | 69.61 ms/op | 38.4% |
| Geomean | 28.55 ms/op | 21.27 ms/op | 25.5% |
---------
Signed-off-by: Israel Blancas <iblancasa@gmail.com>
Co-authored-by: Andrzej Stencel <andrzej.stencel@elastic.co>1 parent defe51c commit 46d8616
11 files changed
Lines changed: 516 additions & 38 deletions
File tree
- .chloggen
- pkg/stanza/fileconsumer
- internal
- archive
- fileset
- fingerprint
- tracker
Lines changed: 23 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
342 | 342 | | |
343 | 343 | | |
344 | 344 | | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
| 353 | + | |
| 354 | + | |
| 355 | + | |
| 356 | + | |
| 357 | + | |
| 358 | + | |
| 359 | + | |
| 360 | + | |
| 361 | + | |
| 362 | + | |
| 363 | + | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
345 | 389 | | |
346 | 390 | | |
347 | 391 | | |
| |||
411 | 455 | | |
412 | 456 | | |
413 | 457 | | |
414 | | - | |
415 | | - | |
| 458 | + | |
| 459 | + | |
416 | 460 | | |
417 | 461 | | |
418 | 462 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
93 | 93 | | |
94 | 94 | | |
95 | 95 | | |
96 | | - | |
| 96 | + | |
97 | 97 | | |
98 | 98 | | |
99 | 99 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
25 | 34 | | |
26 | | - | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
27 | 39 | | |
28 | 40 | | |
29 | 41 | | |
30 | | - | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
31 | 48 | | |
32 | 49 | | |
33 | 50 | | |
| |||
38 | 55 | | |
39 | 56 | | |
40 | 57 | | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
41 | 91 | | |
42 | | - | |
43 | 92 | | |
44 | 93 | | |
45 | 94 | | |
46 | 95 | | |
47 | | - | |
48 | | - | |
49 | | - | |
| 96 | + | |
50 | 97 | | |
51 | 98 | | |
52 | 99 | | |
53 | | - | |
54 | | - | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
55 | 157 | | |
56 | 158 | | |
57 | 159 | | |
58 | 160 | | |
59 | 161 | | |
60 | 162 | | |
61 | | - | |
62 | | - | |
| 163 | + | |
63 | 164 | | |
64 | 165 | | |
65 | 166 | | |
66 | 167 | | |
67 | 168 | | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
| 177 | + | |
| 178 | + | |
| 179 | + | |
| 180 | + | |
| 181 | + | |
| 182 | + | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
| 229 | + | |
| 230 | + | |
| 231 | + | |
| 232 | + | |
| 233 | + | |
| 234 | + | |
| 235 | + | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
68 | 240 | | |
69 | 241 | | |
70 | 242 | | |
| |||
0 commit comments