steep-hospital-96238
06/18/2025, 6:29 PMcount_window()
but it requires me to assign a key to a stream which I don't want to do. Is there a magic "key" that keeps events on the same worker that they live?witty-hydrogen-53156
05/23/2025, 11:56 AMgreen-mouse-71671
05/05/2025, 12:44 PMwait_for_system_duration
. When it's 123033.200 and the trade comes at this time with event time 123033:000 it is marked as late
(because of the 100ms wait_for_system_duration
) which is not correct for my case as I still want to include such trade in the aggregate.
I can set the wait_for_system_duration
to 1min which then won't mark the trades as late but when I get to the end of the time window (123100.000) the aggregates won't be published - its gonna wait 1min to publish them which is also not what I want - I want to publish the aggregates immediately after window closes.
How to achieve this? Is it even possible with bytewax?full-ice-47894
04/29/2025, 7:04 PMOpenTelemetry trace error occurred. cannot send message to batch processor as the channel is full
What's strange is that if I change the window clock from an EventClock to a SystemClock in https://git.ligo.org/ultra-swift/shigawire/-/blob/main/shigawire/operators/__init__.py?ref_type=heads#L96-98, the error message goes away and the traces start showing up in Jaeger. Has anyone seen an issue like this before?mammoth-kitchen-79210
04/22/2025, 3:15 PMloud-fall-50710
04/19/2025, 2:57 PMwonderful-hairdresser-34729
04/15/2025, 2:07 PMthankful-morning-24927
04/15/2025, 11:24 AMable-airline-43286
04/14/2025, 7:51 PMmapped = op.stateful_map('map_op', input, my_stateful_mapper)
do i need to add any lock inside the my_stateful_mapper
?quiet-barista-67180
04/07/2025, 12:54 AMripe-umbrella-71285
04/05/2025, 6:40 PMdamp-finland-12935
04/02/2025, 6:48 AMKafkaSinkMessage
, or do you need to manually compress bytes and add the appropriate header?important-lunch-39532
03/25/2025, 11:51 PMrapid-nail-81224
03/25/2025, 3:17 AMancient-computer-86146
03/06/2025, 9:00 PMgreat-shampoo-77301
03/06/2025, 12:54 PMwonderful-hairdresser-34729
03/06/2025, 3:41 AMprocess
container in pod-0, there is no indication that bytewax api is doing anything. in the api-cache
container, it's the same story. however, for process
in pod-1, i see a bunch of failed attempts to lookup the dns name of pod-0 (which i guess is because it's waiting on pod-0 to initialize and start up, which also means the k8s Service
finally gets created and the internal k8s DNS name for that service is finally resolvable) and then radio silence. the next set of logs are from my own bytewax flow.
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
worker 1: error connecting to worker 0: failed to lookup address information: Name or service not known; retrying
mammoth-postman-28882
03/04/2025, 2:01 AMBytewax connector for X
or X connector for Bytewax
? previously i noticed only the former, but now seeing both. AIs are leaning towards the latter. so would like to confirm. some existing examples from https://github.com/bytewax org
• Bytewax connector for Rerun
• Bytewax Sink and Source Connector for InfluxDB
• Redis connector for Bytewax
mammoth-postman-28882
02/27/2025, 8:43 AMbytewax-s2
. let me know if you have any feedback on https://github.com/s2-streamstore/bytewax-s2 & https://s2.dev/docs/integrations/bytewax#real-time-insights-from-bluesky-firehose-data. before we share in our socials, would like to check a few things w/ you:
1. is there anything that is required to be done at your end before we can share?
2. what is required at our end to make S2 get listed in https://bytewax.io/modules?
3. is it cool if we tag bytewax's company page/handle when we share?damp-finland-12935
02/26/2025, 1:27 AMwonderful-hairdresser-34729
02/24/2025, 4:42 AMvalues.yaml
i can set configuration.configMap.create
= false
. but in the helm rendered statefulset, i see there is hardcoded assumption that the configmap is always mounted into the pod. suggest to change the behaviour of the helm chart so that setting configuration.configMap.create = false
would remove that hardcoded assumption.wonderful-hairdresser-34729
02/23/2025, 11:13 AMmagnificent-kite-85568
02/21/2025, 12:08 PMkeyed = op.key_on(‘url-hash’, normalized,
lambda p: str(hash(p.urls[0]) % 3))
`
4) builds batches of N messages
batch = op.collect(‘build-batch’, keyed, timeout=timedelta(milliseconds=100),
max_size=app_config.BATCH_SIZE)
5) does some heavylifing on these batches (including gpu etc)
classified = op.flat_map(‘classifier’, batch, batch_classifier.process)
6) puts the result to KafkaSinkMessage
7) sends it to kafka.
Now, the problem I am experiencing is that step 5) receives a lot of data and is able to process it - I can see from the logs that it successfully produces results. But then, when it comes to creating KafkaSinkMessages in step 6), the behavior is strange - nothing happens for few minutes, then it receives a burst of messages form like 100+ processed batches, all at one time, and huge amount of messages is being sent to kafka in step 7). Then the cycle repeats - step 5) is being executed for few minutes, the impression is that 6) and 7) are not happening when they should, and after another few minutes - again the burst. In the meantime memory consumption of the app goes through the roof and usually the app dies of OOM after few such cycles.
The question is - why is it happening like that?damp-finland-12935
02/19/2025, 2:16 AMwonderful-hairdresser-34729
02/17/2025, 11:12 PMcollect
function to accumulate messages so that I can get a numbytes and nummessages counter calculated. Is there a better way to do this? Accumulating messages is cool but it feels like it's a waste of memory when all I need is a sum of bytes and sum of message count. Ideally I would like to be able to add to message count and bytes and then throw the message away. I also want the ability to emit these two metrics once every minute but only if there's messages consumed, which is why the collect
function was chosen. I am using this against a Kafka cluster.great-shampoo-77301
02/17/2025, 8:22 AMwonderful-hairdresser-34729
02/13/2025, 9:09 AMwonderful-hairdresser-34729
02/12/2025, 9:14 AMstep_id
to make it unique. i could create a new stream with a different step_id but the old one is still running.great-shampoo-77301
02/12/2025, 8:56 AMcolossal-daybreak-85530
02/06/2025, 5:33 PM