Meme transcription:

Panel 1: Bilbo Baggins ponders, “After all… why should I care about the difference between int and String?

Panel 2: Bilbo Baggins is revealed to be an API developer. He continues, “JSON is always String, anyways…”

  • brian@programming.dev
    link
    fedilink
    arrow-up
    3
    ·
    2 hours ago

    json doesn’t have ints, it has Numbers, which are ieee754 floats. if you want to precisely store the full range of a 64 bit int (anything larger than 2^53 -1) then string is indeed the correct type

      • bleistift2@sopuli.xyzOP
        link
        fedilink
        English
        arrow-up
        14
        ·
        23 hours ago

        Or even funnier: It gets parsed in octal, which does yield a valid zip code. Good luck finding that.

        • raman_klogius@ani.social
          link
          fedilink
          English
          arrow-up
          3
          ·
          4 hours ago

          Who tf decided that a 0 prefix means base 8 in the first place? If a time machine was invented somehow I’m going to cap that man, after the guy that created JavaScript.

        • kamen@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          7 hours ago

          Oof.

          I guess this is one of the reasons that some linters now scream if you don’t provide base when parsing numbers. But then again good luck finding it if it happens internally. Still, I feel like a ZIP should be treated as a string even if it looks like a number.

          • bitfucker@programming.dev
            link
            fedilink
            arrow-up
            3
            ·
            3 hours ago

            Yep. Much like we don’t treat phone numbers like a number. The rule of thumb is that if you don’t do any arithmetic with it, it is not a “number” but numeric.

          • bleistift2@sopuli.xyzOP
            link
            fedilink
            English
            arrow-up
            3
            ·
            22 hours ago

            I’m not sure if you’re getting it, so I’ll explain just in case.

            In computer science a few conventions have emerged on how numbers should be interpreted, depending on how they start:

            • decimal (the usual system with digits from 0 to 9): no prefix
            • binary (digits 0 and 1): prefix 0b, so 0b1001110
            • octal (digits 0 through 7): prefix 0, so 0116
            • hexadecimal (digits 0 through 9 and then A through E): prefix 0x, so 0x8E

            If your zip code starts with 9, it won’t be interpreted as octal. You’re fine.

            • xthexder@l.sw0.com
              link
              fedilink
              arrow-up
              7
              ·
              edit-2
              21 hours ago

              Well, you’re right. I wasn’t getting it, but I’ve also never seen any piece of software that would treat a single leading zero as octal. That’s just a recipe for disaster, and it should use 0o116 to be unambiguous

              (I am a software engineer, but was assuming you meant it was hardcoded to parse as octal, not some weird auto-detect)

                • xthexder@l.sw0.com
                  link
                  fedilink
                  arrow-up
                  5
                  ·
                  edit-2
                  19 hours ago

                  Interesting that strtol in C does that. I’ve always explicitly passed in base 10 or 16, but I didn’t know it would auto-detect if you passed 0. TIL.

              • Doc Avid Mornington@midwest.social
                link
                fedilink
                English
                arrow-up
                1
                ·
                20 hours ago

                It’s been a long time, but I’m pretty sure C treats a leading zero as octal in source code. PHP and Node definitely do. Yes, it’s a bad convention. It’s much worse if that’s being done by a runtime function that parses user input, though. I’m pretty sure I’ve seen that somewhere in the past, but no idea where. Doesn’t seem likely to be common.

  • RustyNova@lemmy.world
    link
    fedilink
    arrow-up
    103
    arrow-down
    1
    ·
    edit-2
    2 days ago

    To whoever does that, I hope that there is a special place in hell where they force you to do type safe API bindings for a JSON API, and every time you use the wrong type for a value, they cave your skull in.

    Sincerely, a frustrated Rust dev

    • Aux@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      17 hours ago

      Well, apart from float numbers and booleans, all other types can only be represented by a string in JSON. Date with timezone? String. BigNumber/Decimal? String. Enum? String. Everything is a string in JSON, so why bother?

      • RustyNova@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        8 hours ago

        I got nothing against other types. Just numbers/misleading types.

        Although, enum variants shall have a label field for identification if they aren’t automatically inferable.

        • Aux@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          8 hours ago

          Well, the issue is that JSON is based on JS types, but other languages can interpret the values in different ways. For example, Rust can interpret a number as a 64 bit int, but JS will always interpret a number as a double. So you cannot rely on numbers to represent data correctly between systems you don’t control or systems written in different languages.

    • Mubelotix@jlai.lu
      cake
      link
      fedilink
      arrow-up
      3
      ·
      1 day ago

      You HAVE to. I am a Rust dev too and I’m telling you, if you don’t convert numbers to strings in json, browsers are going to overflow them and you will have incomprehensible bugs. Json can only be trusted when serde is used on both ends

      • RustyNova@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        8 hours ago

        This is understandable in that use case. But it’s not everyday that you deal with values in the range of overflows. So I mostly assumed this is fine in that use case.

    • Carighan Maconar@lemmy.world
      link
      fedilink
      arrow-up
      5
      arrow-down
      1
      ·
      1 day ago

      Relax, it’s just JSON. If you wanted to not be stringly-typed, you’d have not used JSON.

      (though to be fair, I hate it when people do bullshit types, but they got a point in that you ought to not use JSON in the first place if it matters)

      • RustyNova@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        1 day ago

        As if I had a choice. Most of the time I’m only on the receiving end, not the sending end. I can’t just magically use something else when that something else doesn’t exist.

        Heck, even when I’m on the sending end, I’d use JSON. Just not bullshit ones. It’s not complicated to only have static types, or having discriminant fields

    • skuzz@discuss.tchncs.de
      link
      fedilink
      arrow-up
      35
      ·
      2 days ago

      “Hey, it appears to be int most of the time except that one time it has letters.”

      throws keyboard in trash

      • Username@feddit.de
        link
        fedilink
        arrow-up
        15
        ·
        2 days ago

        Rust has perfectly fine tools to deal with such issues, namely enums. Of course that cascades through every bit of related code and is a major pain.

        • RustyNova@lemmy.world
          link
          fedilink
          arrow-up
          23
          ·
          2 days ago

          Sadly it doesn’t fix the bad documentation problem. I often don’t care that a field is special and either give a string or number. This is fine.

          What is not fine, and which should sentence you to eternal punishment, is to not clearly document it.

          Don’t you love when you publish a crate, have tested it on thousands of returned objects, only for the first issue be “field is sometimes null/other type?”. You really start questioning everything about the API, and sometimes you’d rather parse it as serde::Value and call it a day.

    • Rednax@lemmy.world
      link
      fedilink
      arrow-up
      11
      arrow-down
      3
      ·
      2 days ago

      The worst thing is: you can’t even put an int in a json file. Only doubles. For most people that is fine, since a double can function as a 32 bit int. But not when you are using 64 bit identifiers or timestamps.

      • Ethan@programming.dev
        link
        fedilink
        English
        arrow-up
        30
        ·
        2 days ago

        That’s an artifact of JavaScript, not JSON. The JSON spec states that numbers are a sequence of digits with up to one decimal point. Implementations are not obligated to decode numbers as floating point. Go will happily decode into a 64-bit int, or into an arbitrary precision number.

        • Aux@lemmy.world
          link
          fedilink
          arrow-up
          1
          arrow-down
          1
          ·
          17 hours ago

          What that means is that you cannot rely on numbers in JSON. Just use strings.

          • JackbyDev@programming.dev
            link
            fedilink
            English
            arrow-up
            1
            ·
            3 hours ago

            Unless you’re dealing with some insanely flexible schema, you should be able to know what kind of number (int, double, and so on) a field should contain when deserializing a number field in JSON. Using a string does not provide any benefits here unless there’s some big in your deserialzation process.

            • Aux@lemmy.world
              link
              fedilink
              arrow-up
              1
              ·
              1 hour ago

              What’s the point of your schema if the receiving end is JavaScript, for example? You can convert a string to BigNumber, but you’ll get wrong data if you’re sending a number.

    • bleistift2@sopuli.xyzOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      22 hours ago

      Hell, no. If I wanted to save bytes, I’d use a binary format, or just fucking zip the JSON. Looking at a request-response pair and quickly understanding the transferred data is invaluable.

    • 0x0@programming.dev
      link
      fedilink
      arrow-up
      10
      arrow-down
      1
      ·
      1 day ago

      If there are no humans in the loop, sure, like for data transfer. But for, e.g., configuration files, i’d prefer a text-based solution instead of a binary one, JSON is a nice fit.

        • bitfucker@programming.dev
          link
          fedilink
          arrow-up
          1
          ·
          3 hours ago

          Until someone cannot tell the difference between tab and space when configuring or you miss one indentation. Seriously, whoever thinks indentation should have semantic meaning for computers should burn in hell. Indentation is for us, humans, not computers. You can write a JSON with or without indentation if you want. Also, use JSON5 to have comments and other good stuff for a config file.

        • JackbyDev@programming.dev
          link
          fedilink
          English
          arrow-up
          1
          ·
          3 hours ago

          It’s entirely disingenuous because who the hell is throwing JSON into YAML without converting it? Oh wow, I changed the file extension and it still works. I’m so glad we changed to YAML!

        • bob_lemon@feddit.org
          link
          fedilink
          arrow-up
          2
          ·
          18 hours ago

          Yaml is just arcane bullshit to actually write as a human. Nor is it intuitively clear how yaml serializes.

      • frezik@midwest.social
        link
        fedilink
        arrow-up
        1
        ·
        1 day ago

        What I’d like for a configuration language is a parser that can handle in-place editing while maintaining whitespace, comments, etc. That way, automatic updates don’t clobber stuff the user put there, or (alternatively) have sections of ### AUTOMATIC GENERATION DO NOT CHANGE###.

        You need a parser that handles changes on its own while maintaining an internal representation. Something like XML DOM (though not necessarily that exact API). There’s a handful out there, but they’re not widespread, and not on every language.

          • frezik@midwest.social
            link
            fedilink
            arrow-up
            1
            ·
            3 hours ago

            Is a very good idea providing much needed fixes to the JSON spec, but isn’t really what I’m getting at. Handling automatic updates in place is a software issue, and could be done on the older spec.

            • bitfucker@programming.dev
              link
              fedilink
              arrow-up
              2
              ·
              3 hours ago

              Hmm, maybe I am missing the point. What exactly do you mean by handling automatic updates in place? Like, the program that requires and parses the config file is watching for changes to the config file?

              • frezik@midwest.social
                link
                fedilink
                arrow-up
                2
                ·
                edit-2
                2 hours ago

                As an example, Klipper (for running 3d printers) can update its configuration file directly when doing certain automatic calibration processes. The z-offset for between a BLtouch bed sensor and the head, for example. If you were to save it, you might end up with something like this:

                [bltouch]
                z_offset: 3.020
                ...
                #*# <---------------------- SAVE_CONFIG ---------------------->
                #*# DO NOT EDIT THIS BLOCK OR BELOW. The contents are auto-generated.
                #*#
                [bltouch]
                z_offset: 2.950
                

                Thus overriding the value that had been set before, but now you have two entries for the same thing. (IIRC, Klipper does comment out the original value, as well.)

                What I’d want is an interface where you can modify in place without these silly save blocks. For example:

                let conf = get_config()
                conf.set( 'bltouch.z_offset', 2.950 )
                conf.add_comment_after( 'bltouch.z_offset', 'Automatically generated' )
                conf.save_config()
                

                Since we’re declaratively telling the library what to modify, it can maintain the AST of the original with whitespace and comments. Only the new value changes when it’s written out again, with a comment for that specific line.

                Binary config formats, like the Windows Registry, almost have to use an interface like this. It’s their one advantage over text file configs, but it doesn’t have to be. We’re just too lazy to bother.

                • bitfucker@programming.dev
                  link
                  fedilink
                  arrow-up
                  2
                  ·
                  2 hours ago

                  Ahh, then the modification must be done on the AST level not the in-memory representation since anyway you do it, you must retain the original.

    • themusicman@lemmy.world
      link
      fedilink
      arrow-up
      4
      arrow-down
      1
      ·
      1 day ago

      If you’re moving away from text formats, might as well use a proper serialisation tool like protobuf…

  • andyburke@fedia.io
    link
    fedilink
    arrow-up
    43
    arrow-down
    1
    ·
    2 days ago

    These JSON memes got me feeing like some junior dev out there is upset because they haven’t read and understood the docs.

    • RustyNova@lemmy.world
      link
      fedilink
      arrow-up
      7
      ·
      2 days ago

      If a item can have different type, those label fields are actually quite useful. So I don’t see the problem