Hacker Newsnew | past | comments | ask | show | jobs | submit | jeden's commentslogin

ideal for ddos ;(


I wait on new codec invented #AI


You'll be waiting for a long time then, probably. Making codecs is actually a hard problem, the type of thing that AI completely falls over when tasked with.


Compression is actually a very good use case for neural networks (i.e. don't have an LLM develop a codec, but rather train a neural network to do the compression itself).

It works amazingly well with text compression, for example: https://bellard.org/nncp/


Considering AI is good at predicting things and that’s largely what compression does, I could see machine learning techniques being useful as a part of a codec though (which is a completely different thing from asking ChatGPT to write you a codec)


Yeah in the future we might use some sort of learned spatial+temporal representation to compress video, same for audio. Its easier to imagine for audio: Instead of storing the audio samples, we store text + some feature vectors that uses some model to "render" the audio samples.


It’s not absurd to think that you could send a model of your voice to a receiving party and then have your audio call just essentially be encoded text that gets thrown through the voice generator on the local machine.

AI video could mean that essential elements are preserved (actors?) but other elements are generated locally. Hell, digital doubles for actors could also mean only their movements are transmitted. Essentially just sending the mo-cap data. The future is gonna be weird


Yeah, I brought that up here and got some interesting responses:

> It would be interesting to see how far you could get using deepfakes as a method for video call compression.

> Train a model locally ahead of time and upload it to a server, then whenever you have a call scheduled the model is downloaded in advance by the other participants.

> Now, instead of having to send video data, you only have to send a representation of the facial movements so that the recipients can render it on their end. When the tech is a little further along, it should be possible to get good quality video using only a fraction of the bandwidth.

https://news.ycombinator.com/item?id=22907718

Specifically for voice, this was mentioned:

> A Real-Time Wideband Neural Vocoder at 1.6 Kb/S Using LPCNet

https://news.ycombinator.com/item?id=19520194


In the future, our phone contacts will store name, address, phone number, voice model. (The messed up part will be that the user doesn’t necessarily send their model, but the model could be crafted from previous calls)

You could probably also transmit a low res grayscale version of the video to “map” any local reproduction to. Kinda like how a low resolution image could be reasonably reproduced if an artist knew who the subject was.


Simple program in Polish language for labeling images for YOLO system.

AppImage version.

simple labels: etykiety.txt 0: person 1: bicycle 2: car 3: motorcycle 4: airplane 5: bus 6: train 7: truck 8: boat 9: traffic light 10: fire hydrant 11: stop sign 12: parking meter 13: bench 14: bird 15: cat


any example? I search wasm without js.


I noticed this question yesterday, which includes the changed build options for freestanding.

https://ziggit.dev/t/wasm-index-out-of-bounds-only-in-debug-...



why rust compiler create so BIG executable!


please compile Your egui program and check:

valgrind --leak-check=full --show-reachable=yes --track-origins=yes -s ./your_program

is memory leak?


EU planing open 169 MHz for ISM (free) communications but trouble is distance. This band is world


;(

* no 802.11s s == mesh is best choice today. * no mechanical keyboard ? * I prefer chocolate / planck layout * fat, still too fat

options

* I need power, mobility power for computer. Why not put 2-3 x 18650 for ups or emergency working?


or cristal ;)

(or mrubyc)


https://notes.eatonphil.com/lua-in-rust.html It's some kind of developer trend.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: