More

VeejayRampay · 2025-10-22T20:18:15 1761164295

thanks for this post

it should be repeated ad-nauseam that he is a crook, a shame for the country and its values and that the whole discourse about the injustice of the sentencing has heavy anti-liberal vibes

VeejayRampay · 2025-10-17T07:13:27 1760685207

you're not in the minority, there's just intense fanboyism on Hacker News to promote OpenAI, because it serves the whole "LLM revolution" schtick better

Gemini has been dominating the field for about a year now, but I suppose Google is bit boring cause they just do things well

VeejayRampay · 2025-10-13T12:54:30 1760360070

due to the nature of PDF, none of the tools mentioned here can do things as simple as detecting tables on pages with high accuracy

PDF is absolutely mint for display but it really suffers when parsing is involved

WillAdams · 2025-10-13T13:54:39 1760363679

Yeah, I've been expecting someone to work up a system where:

- source file is .md

- file is compiled to .pdf _and_ the .md source file is included as an attachment

- when working with the file beyond viewing as a .pdf the .md is extracted and used instead of the .pdf

The LaTeX folks have a similar system ages ago where the .tex source would be included in a .pdf made from a .tex file for embedding in documents so that it could be sent in say an e-mail and then edited by the recipient --- absolutely awesome for discussing math via e-mail.

apf6 · 2025-10-13T14:31:38 1760365898

That's a good concept but I don't think Markdown is expressive enough for all the layouts & formatting that people typically want in PDFs. More likely that the source format would be something like HTML or SVG or .docx.

kevin_thibedeau · 2025-10-13T15:16:14 1760368574

Restructured text has mostly 1:1 correspondence with Docbook. I use an XSLT transform to convert its XML schema into Docbook and PDF from there via XSL-FO.

VeejayRampay · 2025-10-01T20:59:37 1759352377

python will be the last man standing with basically no functional goodies

people will keep on trucking with their "pythonic" for loops containg appends to a list that was initialized right before, their disgust for recursion, their absence of lambdas, the lack of monadic return types or containers

jghn · 2025-10-01T22:19:22 1759357162

> with basically no functional goodies

Python has had `map` and friends for well over 20 years. Also see the built in `functools`

skirmish · 2025-10-02T07:10:38 1759389038

List, dict and set comprehensions, generators are used quite a lot in Python and feel very functional to me.

VeejayRampay · 2025-09-22T11:41:45 1758541305

remotely related, but I have yet to find a solution for page classification in a document for tables, i.e. a classifier that returns the index of pages containing tables in a document that is reliable

solutions using things like img2table or pymupdf are really bad (pymupdf is not even reliable for text pdfs)

djoldman · 2025-09-22T15:53:59 1758556439

In my experience, this task is incredibly difficult for generality.

Handcrafting based on the dataset is the only way to get high performance.

VeejayRampay · 2025-09-17T15:25:23 1758122723

for context, the author is Aaron Patterson of Ruby and Ruby and Rails fame, a proficient C programmer and overall hacker, he knows his stuff

VeejayRampay · 2025-09-14T14:29:52 1757860192

it's funny to observe how picky and cynical the HN crowd suddenly becomes when the disruptive technology is from china

bastawhiz · 2025-09-14T19:45:00 1757879100

What part of this is disruptive? It kind of has to work well to be disruptive, doesn't it?

ramon156 · 2025-09-14T15:38:10 1757864290

You can't be critical anymore?

izabera · 2025-09-14T22:49:03 1757890143

deepseek is from china and all their papers have been very well received

VeejayRampay · 2025-09-14T07:52:13 1757836333

160 comments on this, OpenAI is definitely not the hype anymore

VeejayRampay · 2025-08-22T16:51:12 1755881472

it is pedantic, everyone knows what "node" means in this context

genshii · 2025-08-22T17:22:17 1755883337

Apparently not, because I first assumed that he was talking about TypeScript considering that JavaScript doesn't have much of type system to compare to.

VeejayRampay · 2025-08-22T13:26:57 1755869217

and speed, it's way way faster