OCR Error Correction, Take 2
Last week I wrote about OCR error correction and the three general approaches to it. Over the week, I got practical and fixed OCR errors in my dataset. However, I used a fourth approach, which is much more efficient and fixed about 80% of all OCR errors. Read more on my progress!
On the State of OCR Correction
If you have ever worked with text, you will know the huge pain that OCR (Optical Character Recognition) errors can pose. OCR has one job: Detect all the letters and numbers on an image and spit out a text file that contains the text that has been recognized. However, OCR errors still occur frequently. Here I outline what I've learned so far in this regard.
Afghanistan has fallen
Every generation has their own catastrophic crisis to bear. Be it the French revolution, the two World Wars, Vietnam, the collapse of the Soviet Union or the 2008 financial crisis. For me and my generation, it was Afghanistan. And now, after twenty years – my whole adult life –, all of this is over. Afghanistan is lost, and with it all hope that, after more than fourty years, Afghanistan may return to its pre-war peace.
UX Design is not for Creative People
One of the biggest concerns I had over the past four years of developing Zettlr was that I'm not a designer by trade and apart from some obvious rules I know nothing about cool design stuff. In fact, all my furniture is from Ikea and the only art I have in my flat are some mass-produced band posters. That's how non-designery I am. But recently, I have come to the conclusion that this might not be as detrimental for Zettlr as I always thought.
The Quiet Error that almost cost me my Sanity
I'm attending a summer school right now, so I don't have that much time to write up something awesome. But yesterday I finally resolved a very quiet error that had almost cost me my sanity. It occurred almost a year ago, and I wasn't able to fix it until yesterday. So, come and read a short story about what can go wrong if you forget what an INI-file is supposed to look like.
Telemetry, Data Privacy, and Zettlr
A few days ago, a scandal erupted in the Open Source Community: After being acquired by the Muse group, the free audio editor Audacity was supposed to receive an update that would include telemetry in the app. And users didn't like that. So today I want to talk about telemetry, what it is, and how the Audacity debacle prompted me to finally purge any form of data transmission from Zettlr myself.
Update: Setting up Python, numpy, and PyTorch natively on Apple M1
A few months ago I received my first MacBook with an M1 chip – a.k.a.: It uses the ARM-architecture instead of the default Intel-architecture that most modern computers use. Back then I told you that installing PyTorch is a pain; and back then it was because the ecosystem wasn't fully adapted to the fact that suddenly MacBooks could also have ARM instruction sets. However, in the past six months, lots has changed. As I'm getting more and more emails by people asking whether or not my old instructions still hold true, I've decided it's time for an update.
On Electron, the Bloated Web, and Trade-Offs
After a short discussion on the Zettlr issue tracker about possibilities of making the app portable, I began a quick Google search about how to actually do that. Instead of finding ways of creating a portable version of an Electron app, I found a rabbit hole. And it was a rabbit hole I know by heart, because I've followed it down many, many times. It's a story about the bloat of the web and why Electron is bad. Here I ponder about the thought of efficiency, and why certain trade-offs are sometimes beneficial.
Bitwise Flags are Beautiful, and Here’s Why
This is an article I've been wanting to write for a long time. Programming and writing code is often about efficiency: It has to work, and that's it. But there is more to code. Sometimes, it can be very beautiful, and there are a few patterns and capabilities of code that make it almost like art. Today I want to devote a full article to one such concept: Bitwise flags.
I stopped Measuring my Time, and you should too
We live in a very data-centric world and like to measure everything, from the steps we take each day to the time we spend online. While measuring steps can simply prompt you to walk more (especially during a pandemic where it’s normal to not leave the house on many days) sometimes measuring can have detrimental results. I had one of these experiences just recently, and therefore would like to warn you sometimes not to measure yourself.