Show HN: Compile C to Not Gates

github.com

142 points by tomhee 5 days ago

Hi! I've been working on the flipjump project, a programming language with 1 opcode: flip (invert) a bit, then jump (unconditionally). So a bit-flip followed by more bit-flips. It's effectively a bunch of NOT gates. This language, as poor as it sounds, is RICH.

Today I completed my compiler from C to FlipJump. It takes C files, and compiles them into flipjump. I finished testing it all today, and it works! My key interest in this project is to stretch what we know of computing and to prove that anything can be done even with minimal power.

I appreciate you reading my announcement, and be happy to answer questions.

More links:

- The flipjump language: https://github.com/tomhea/flip-jump https://esolangs.org/wiki/FlipJump

- c2fj python package https://pypi.org/project/c2fj/

bangaladore 5 days ago

Reminds me of movfuscator [1]. This can compile programs to movs and only movs.

[1] https://github.com/Battelle/movfuscator

  • LPisGood 4 days ago

    Battelle is great. They also created some software called Cantor Dust [1] that turns files into images to allow humans to easily spot obfuscated data or files.

    The sad thing about this kind of work, because I love it, is that to get paid to do it you need clearances and polygraphs and periodic reinvestigations/continuous monitoring and all sorts of things that I find unpleasant.

    [1] https://github.com/Battelle/cantordust

    • mmastrac 4 days ago

      I'm not sure what you mean but I was a security researcher for a large company for a bit and required none of that. I was required to work airgapped at home, however.

      • LPisGood 4 days ago

        Really? You were doing offensive security work not for a government (/contractor)? What sorts companies, aside from some enterprise pen testers, employ these roles?

        • saagarjha 4 days ago

          The tools you’re talking about are not exclusive to offensive security. They’re plenty useful for malware analysis and other reverse engineering tasks.

        • mmastrac 4 days ago

          Email is in my profile -- happy to clarify/share some very rough details if you'd like.

  • beng-nl 4 days ago

    Agreed that is a fine piece of work. But the author is Chris Domas. Which is plain from the repo readme, but it’d be clearer to link to his repo.

tromp 4 days ago

Am I right in deducing that this language gets its power from self-modifying code? I.e. flipping bits within addresses of the opcodes of the running program?

  • tomhee 4 days ago

    You are indeed right

    • tromp 4 days ago

      I would have expected the language documentation to focus more on this observation and to explain for instance how self modification is used to implement while loops. But I don't even see the term mentioned anywhere?!

Firehawke 4 days ago

Wouldn't it be better to call it "compile C to Linux or BSD"?

I kid, I kid.

pizza 4 days ago

Ah interesting.. wonder if you can model this with a recursively expanded algebraic expression. I've been thinking lately along similar lines about polynomials that encode pushdown automata, so this is cool to see.

  • tomhee 4 days ago

    If you have an answer I'd be happy to hear it!

dingdingdang 4 days ago

It always amazes me that this is possible (to some extend anyway, I mean, the base layer is binary so obviously simpler higher-end CPU instructions are possible!)

Is there any potential performance win in this? What I mean is; since this general direction could, in principle if not in practise, enable the targeting of say, the 5-10 most efficient CPU instructions rather than attempting to use the whole surface area... would this potentially be a win?

eimrine 4 days ago

I was expecting to see a way to translate hello_world.c into an electronic schematic full of NAND elements, kind of Mealy machine.

tonetegeatinst 4 days ago

Looking forward to the poor security researcher who gets to reverse engineer some malware sample they compiles this into for obfuscation... Its going to be an interesting blog post.

tomhee 4 days ago

By the way, as a challenge, try how you can program an "If" statement in Flipjump.

  • greenbit 4 days ago

    I wondered this as well.

    Thinking about it, if all you have is "invert some (N>1?) bits somewhere and jump to somewhere" .. I could see maybe it might work if you use self modifying code and there's really a 2nd instruction that is a no-op? Seems like it might work more like a cellular automata?

    Of course, one could just go look at the documentation, but where's the fun in that?

    • int_19h 3 days ago

      You don't need a no-op; you can always just flip a bit you don't care about (e.g. reserve a word just for that) and then jump to next instruction.

jkrshnmenon 4 days ago

I wonder if someone has already made a Reverse Engineering CTF challenge for this concept.

jvanderbot 4 days ago

Is the family of circuits using just NOT gates actually universal? Or is "flip" and "jump" secretly using a lot of other gates.

  • tomhee 4 days ago

    The power is within the self modification of the code. The jump might be implemented by a multiplexer, though it can be implemented in other ways too.

    • jvanderbot 4 days ago

      A CNOT is universal (transistor effectively) I don't think a NOT is universal.

      I'm sure you can self modify your code so it executes only using XOR (bit flips), which is a CNOT, but I do not think this could be compiled down to an FPGA using only a billion not gates.

      Actually I just convinced myself you can make an AND from three NOT gates if you can tie outputs together to get OR, so I believe you now. Sorry for the diversion! (Though I still dont see how bit flips and jumps directly can be built into a circuit, I know AND and NOT are universal so it's all good).

dlcarrier 4 days ago

Maxim (now owned by Analog) actually manufactures a single-instruction processor series, called MAXQ. It uses a single move instruction, with a flag for literals, and a transport triggered architecture.

  • Zamiel_Snawley 4 days ago

    What is the intended use case for such a processor?

    • dlcarrier 3 days ago

      They are embedded microcontrollers, which run real-time deterministic tasks, with tens to hundreds of MIPS on fixed-point tasks. These are the kinds of microcontrollers used in products like household appliances or control systems.

Imustaskforhelp 4 days ago

hey this could actually be pretty nice if we can convert flipjump into sqlite native instructions like how it is possible for brainfuck , then you are on to something huge!

You would create although highly inefficient , after many years , maybe the first , language like those lisps where you could store data in sqlite and run it fromt there (but with C)

Nevermnd 4 days ago

Did I miss something? I thought NAND was you're 'universal gate' ?

artemonster 4 days ago

Id appreciate more explanations from the power of combined bitflip & goto

platz 4 days ago

How is a jump realized by Not Gates?

  • tomhee 4 days ago

    I dont think that the jump can be realized by NOT gates, but it's essentially "where to find the next NOT command". The jump is indeed a crucial part of the language, as it allows going back, and especially to make self-modifying code.

  • Jerrrry 4 days ago

    I'm guessing by not jumping into a terminating/ halting NOOP.

    The logic is within the branching.

dang 5 days ago

Looks like we banned you and this domain because of the egregious vote manipulation and bogus comments at https://news.ycombinator.com/item?id=34856792.

That was a long time ago, though, and the project is interesting enough, so I'm going to assume you've learned your lesson and unban you. Please stop using multiple accounts for this though!

  • tomhee 5 days ago

    Thanks man, I appreciate it.

  • jimbob45 5 days ago

    Dang, I have to know what triggered you to say this. It’s not the same user account so you would have had to have recognized the URL and written based on that.

    Do you keep notes on each astroturfed submission and auto-trigger reposts to notify yourself? Or did you just happen to recognize this? 20 minutes from his post to your comment is absurdly good moderation.