Open Math Insert Foot

What is Sentience?

Sean Robinson — Tue, 28 Jun 2022 20:17:50 +0000

As someone involved heavily in machine learning and AI, I often get called in to answer questions related to AI in pop culture- but the recent LaMDA interview has sparked more conversation on the topic than usual.

In case you haven’t heard the news, a Google engineer in Google’s Responsible AI organization, recently went public with his claim that the LaMDA interface (Language Model for Dialogue Applications) had achieved sentience. Now, the idea of advanced models pushing boundaries until people wonder if they have a form of awareness isn’t that new- I often find myself having this conversation when new models make the news.

However, for whatever reason this time I found myself more delving into the philosophy of the topic rather than the research. Perhaps it’s because there’s just no way to explore this “sentience” question without veering into philosophy. Ultimately, even with as much work as we’ve done, we are still fighting over a good definition of what sentience might look like.

Many people, when thinking about sentience or consciousness, first jump to an ad-hoc argument of “I know it when I see it.” In fact, Blake Lemoine – the Google engineer who first made the claim – used this argument himself claiming, “I know a person when I talk to it.” Most people have a conviction that there’s a difficult-to-pin-down “something” at the core of us that has a concept of itself-as-distinct-from-the-universe and which also forms the image of the self. Whenever thoughts dwell on things like sentience and consciousness, it’s that core that we all seem to keep coming back to, but the sheer meta aspect of “thinking-about-the-self-which-is-composed-of-a-thing-which-can-think-about-the-self-which-is-composed…” is daunting at best. So, a lot of our poking around into these questions boils down to “there’s a thing that we all have, and it’s hard to define, but we know it when we see it.”

Likewise, I could construct some metrics that define a kind of “sentience” but thus far the ones that have deep consensus seem to all be necessary but not sufficient. For example, I could say “a sentient being should be able to profess itself as a distinct being and discuss its actions on those terms,” and while a useful and necessary test, we’re unclear about its sufficiency.

Or I could say that a sentient being should be able to generate thoughts (or at least communication) that are meaningful to a conversation as judged by a human being, but are not simple repetition of memorized phrases. This seems necessary also, but we can clearly demonstrate systems that are capable of doing this with more simplistic, deterministic logic.

So where does that leave us? Even if we created genuine artificial life in the way we imagine, it’s possible that we wouldn’t agree on it being sentient, because we likewise don’t have a “tight” definition for what we’re looking for. This raises some sinister implications – for example, it’s generally agreed upon that humans are sentient, but without a comprehensive definition, it’s unclear if that’s defensible.

In fact, if defined more carefully, it’s possible that I could learn that according to those definitions I am functionally an automaton, whereas perhaps my neighbor does possess whatever qualities I had just enshrined. So, for better or worse, we’re trying to get at a definition that includes all the humans, excludes most machines and at least a big handful of animals, and beyond that we aren’t all that sure. How are we going to answer a deep question like the development of sentience when our rubric is basically “y’know, like us, in ways that feel emotionally right but don’t bring up too many scary questions.”

But, we’ve got an ace in the hole here. A momentary salvation. A way to keep talking without losing sleep tonight. For now.

However tough it is to tackle the philosophical question, it’s easy to kick the rhetorical can down the road and split the question into a couple of pieces. I suggest we do that – for now, I’m going to wrap the question of sentience up in a small black box and say that “effectively, sentience is the quality of making responses that are, to a human being, indistinguishable from what another human *might say* when given the same inputs.”

This is essentially just the Turing test, but splitting the question in this way lets me ask another one immediately afterward: “Can an algorithm be trained to predict (and then implicitly respond in) the way a human *would*” – i.e. can we demonstrate the full range of humanlike responses based on a well understood, but explicitly deterministic mechanism? In short, can we make a model that simulates what a person would say to pass the Turing test? Then after we settle that one we can get down to the less-well-defined-and-more-frightening “is a deterministic algorithm that responds to all inputs in an identical way to a human also a human?”

Of course, that brings up a third issue- personhood. As a friend put it, “I don’t know what makes humans people, but if an AI is a person, I want to treat it like a person, sort of regardless of how it got there.” At what point do we give something rights?

So that really makes 3 fundamentals: “can we make a thing we understand that responds in the same way as a hypothetical person would under essentially all stimuli,” “is there a clear line/structure between ‘autocomplete algorithm with a long input memory’ and ‘sentient being’ that can be enumerated”, and then “does a sentient being get to be a ‘person’ (i.e. have rights as we understand them) depending on the answers to the first two questions?”

When we look at the big neural generator networks, they mostly break down into some broad and understandable pieces. More specifically, they usually have a language encoder, some mechanism to hold memory or a secondary method to focus attention on a subset of the input, several layers of abstract, trainable neurons, and a language decoder. And this probably isn’t so surprising – this is very similar to how humans communicate. We also have a language center that seems to handle the encoding and decoding and manages the complex input and output organs we have to do the actual communication. So, is that enough – encoding, decoding, memory and attention? To answer this question, I think we have to look more closely at how the training process works.

The beating heart of modern ML techniques is the process of training algorithms, which in turn first consists of defining a model (as above, the mathematical guts that take in a string of words and output another string), with a bunch of “parameters” (here the contents of a bunch of matrices that are used in the model). Then we define a set of targets or objectives for this model, these are often input-output pairs, like [“the quick brown fox jumped”, ”over the lazy dog”]. Then we define a “loss function”, which can be complex but basically expresses “how wrong the current model is at producing those right outputs.” Finally, we vary all those parameters (matrix elements) in the direction that most quickly minimizes the loss function, until we can’t go any farther and hopefully when you put “the quick brown fox” into the input of the model, you get “jumped over…” out the output. We then declare the model “trained” and it’s ready to go, we can freely put more word strings in and get more outputs out.

To paraphrase one of my old physics profs, “I’ve swindled you a bit here, by leaving a few things out.” The process of model definition is a whole area of study, the loss function can be super complex, training can come in multiple stages, there are a bunch of ways to minimize loss, etc, etc. But none of that complexity really changes the truth of the process above. Essentially, to “do ML” we have to define a model, define what “good” looks like in terms of a bunch of examples (or as we’ll see below, a bunch of outcomes that depend on the model operating in some environment), then shake the internal parameters of our model until it gets the best “goodness” and call it a day. And here we have what I’ll call the “fundamental” split between training environments – and possibly, depending on how you look at it, the fundamental split between Supervised ML and reinforcement learning-powered AI: The training environment.

So, what I’ve said above is a pretty cursory description of the training environment for a supervised ML approach, which most language models are. Ultimately stuff like GPT3 is trained on a huge corpus of what humans have written. It’s gigantic, but you can go and get it yourself, and in principle, read it all (not that you’d especially want to). But, this is where most language models get their training – we use cut-up strings of human-written examples, and define “good” as “the ability to say the next few things in this sequence.” Alternatively, some models are trained on Q&A pairs, generally from well-researched sets. There’s a lot of caution exercised here – for example, we always reserve a set of examples that don’t go into the training process (which the algorithm doesn’t get to “see” before it’s trained) to verify that the output is still good even on things that are new to it, and ensure it hasn’t just memorized all the results somehow.

And now we arrive at the heart of the issue, from a mathematical perspective. With a sufficient number of parameters (i.e. a complex enough model), you might imagine we might essentially “memorize” all the possible dynamics of speech – in essence, we would have made an autocomplete-on-steroids that isn’t so much thinking through a problem as it is just tuned to answer the “what would a human probably say next” question.

This brings us close to the “is a perfect simulacrum of a conscious being also a conscious being” question, so rather than delve back in there, I’d rather talk about the sorts of tests we would need to answer to figure out if we’ve arrived at sufficient complexity.

A modern generative algorithm is generally “fed” with a prompt – the example of the “input” text with which it then tries to guess the best corresponding output. With the GPT algorithm series, sometimes this went very well, and other times the algorithm had obvious issues with context, or would connect superficially-related things without clearly understanding that context (like if you ended a positive review of the University of Washington with “Go Dawgs!,” the algorithm might then expound on how dogs make good pets – capable of pulling from its “dog-related” set of internal dynamics, but unable to understand that “Go XYZ” is likely referring to a college mascot. At the risk of being too qualitative, it seemed to manage “the most likely stuff surrounding this sentence has these sets of associated words” part of thought, but often struggled with the “does this actually relate to the thing we’re discussing” part. Because of this, most “amazing examples of the new algorithm” articles tend to present highly curated sets of output, showcasing the dramatic “wins” and stuffing the “confident sounding random ramblings” under the rug.

So, this adds up to one of the first necessary tests for this kind of claim, which is: the necessary access by many researchers. Ideally, you’d be able to talk to the algorithm as well as me. We would both be able to try to stump it using examples that most humans would still understand. To try asking questions that are made to tell the difference between actual context and local-word-likelihood coupled to grammatical-rules- in order to see how it responds.

And what we’d expect, with that many researchers, is for a “really conscious being” to create thought, without explicit prompting. And honestly, with a background in this kind of input/output pairing as I look at the specific LaMDA conversation, I see a lot of purposeful “prompting” in the published discussion. I see the input trying to summarize and re-package the topic in a direction where the answer can just be “yes,” with a bit of a continuation.

In legal terms, this might be called “leading the witness,” and I don’t love being so harsh in my criticism, but the chief issue is that this is all we get. None of us “get to” go talk to the algorithm. We aren’t sure if, for example, what’s been published is actually a handpicked subset of what was said – or if (as frequently happens when using generators practically) the algorithm was run 10 times for each input and the researcher selected the “best” answer for each continued exchange. Essentially we don’t have a way of verifying any of this, and the field is absolutely full of examples where a researcher has been thoroughly fooled by their own creation. In short, the reason why general access is needed to answer these questions is so that we don’t get “Clever Hans’ed” over and over (https://en.wikipedia.org/wiki/Clever_Hans).

Of course, this could be seen as somewhat bleak. Am I saying that, until we all get to come and say hi to the new lifeform, we can’t say it really exists? I guess I am, and I wish I wasn’t as cynical about these outcomes. But bear in mind, the field says this sort of thing all the time. When GPT-3 came out, one of the “fun” things to do with it was to make a chatbot that couched the input language as one half of an interview, and preface “Albert Einstein” or “Marie Curie” or whoever you wanted to talk to before letting the algorithm auto-complete what that (often dead) celebrity then “replied.” And people… well, got into trouble sometimes. (https://nypost.com/…/grieving-man-uses-ai-site-to-chat…/)

Thinking about it though, even if I’m a hard sell in cases like this, I still believe we’re probably close to more general AI. The reason I say this, is that the description above isn’t the only way we can train a modern AI system. That cryptic comment I made above (about “a bunch of outcomes that depend on the model operating in some environment?”) That’s the other way.

Maybe the best example to describe this is by talking about AlphaGo (https://www.deepmind.com/res…/highlighted-research/alphago). Back when AlphaGo beat Lee Sedol, it blew all our minds – because since Deep Blue and Kasparov in the 90s, we’d always been told the same story – “sure, chess was a grand challenge, but this will never ever happen with Go, because the number of potential boards is near-infinite, and a correspondingly large computer to hold them all in this way basically cannot exist.”

Well, fast forward about 20 years and it turns out they were right. An algorithm couldn’t have done that, in that way. And it didn’t. Instead, a purpose-built neural algorithm was used, in which the board (19 x 19 x 3 possible things that can be on each location, so not all that big) was used as input, and an output of the same size (with one location “lit up” so to speak, as a choice of move) was paired with a deep neural net. And the training question was simple – “what would a skilled human player do?” Just like discussed above, the first AlphaGo was basically a “human simulator,” trained on a gigantic number of games. But, it also proved that the overall dynamics of Go could be contained in a reasonable size – we might think of the matrices within the trained model as a kind of “encoding” of the dynamics of the game. Was this evidence that independent “thought” was happening somewhere in the encoded guts of the process? No one was completely sure, but then, someone (doubtless someone who disliked wrangling all the data and paying high cloud services bills) had an idea.

What if,” I imagine they thought, “we dispensed with the human-simulator, and all the human data, and just…let two versions play each other?” Doing this got rid of the need for all the large data, in exchange for changing the loss function (i.e. what “good performance” meant), to just winning. Not “doing what a skilled person would do,” but just winning, against another player. The training environment went from a static place where one input had basically one correct output, to a dynamic battle royale where winning behavior sets were “reinforced” over time, to gradually produce a skilled player.

And it didn’t just work. To say it “worked” is underselling the concept a little. It’s more correct to say that it became better than the best player has ever been, and it did this by rejecting its humanity. In just over a month of training without human input, data, or examples, AlphaGo “Zero” became arguably unbeatable.

Genuinely, this algorithm set amazed me as it was developed. They really are superintelligences, in the sense that they have some internal understanding that is a) beyond what any human has and b) we don’t understand. We literally managed to make something that just sailed past what we may be capable of. So far as I know, this didn’t require an internal simulation of the “self” of the algorithm, but we could imagine a situation for which that was the best/most reasonable answer.

It isn’t just purpose-built game-playing AI that pulls this “emergent behavior” trick either. Like Karoly says, “hold onto your papers” on this one: https://www.youtube.com/watch?v=GdTBqBnqhaQ

Robots can develop language this way. Communication, cooperation, deception. Other results have shown the development of things like family units, cartels, grammar, and tool use.

So, what does this kind of “emergent performance” suggest? In my opinion, the answer to consciousness (if we really want to make it) is a combination of this sort of open-ended reinforcement-learning, and a sufficiently complex model and environment so as to make the development of a sense of self the best, most useful use of internal resources. Here, an “internal model” of the self and an internal model of the rest of the world, together with a sort of simulation “engine” that lives within the AI, would evolve so as to determine the best actions to take dynamically. In short, we would want to simulate an entire natural world, capable of creating actors with inputs, outputs, and things that are naturally to be avoided (like pain or death) and things like pleasure to be sought out. I believe a sufficiently complex simulated system like this, coupled to a sufficiently complex set of neurons, would produce that “sense of self as an internal algorithm driver.” So at least in my thinking, the “trick” to consciousness is to not to play “guess what a human does next,” but to create an environment to which the best survival trait is “have an internal model of myself.” Then we fire it up and let gradient descent and survival pressures dictate how that internal model operates.

In plain language then, do I think that LaMDA has attained sentience? No- I think it’s an excellently trained model that does a superb job of responding to user input with realistic conversational output. It’s an algorithm that is very well tuned to respond like a human would respond given proper prompting.

But I do think that consciousness is attainable through emergent performance- and that true AI is still in our future.

Whether we’ll be ready for it when it arrives is a different question entirely.

The post What is Sentience? appeared first on Open Math Insert Foot.

The Tyranny of 3 Dimensions

Sean Robinson — Thu, 10 Oct 2019 22:47:12 +0000

I grew up fat. “Plump” might be a better term for a youngster who was athletic but also ate vast quantities of junk food. Somehow I managed to compete in gymnastics with a frame that, inflated to adult size, had around 70 extra pounds that had to be shredded off for my first bodybuilding show. Those days of excessive plumpness are now behind me, knock on wood, though it’s still a challenge, and not one I often rise to very well. Unfortunately, all that high-impact living and a bit of heritable shenanigans left me with a spinal injury – in particular, a couple of broken vertebrae. Every couple of extra non-functional pounds add up to more pain, and that means that keeping the ol’ strength-to-weight ratio high is kind of an existential need. So, all the focus on diet and the need for “mindful” eating, got me thinking about one of the aspects of food and portion sizing that gets us all in the end.

The fact that this world has 3 spatial dimensions. Yeah, you heard that right – we have 3 dimensions and people get in real trouble because of it. “How can that be,” I hear you cry, “a world with more or less than 3 dimensions just wouldn’t make sense, and besides, we’ve all grown up here and are used to it.” Well, to you I say: sit down and I’ll tell you a story.

So way back in 1993, when I didn’t have no Ph.D, I was a young boy eating Twinkies and Ho-hos, when my parents discovered a store called Costco. Yes, Costco, font of big cheap junk food, big take-and bake pizzas, big piles of big pastries, big big burrito platters with big cheese on top and you get the idea.

Everything is big at that place, and one of the things that my parents would always bring home was pie. Gigantic pies, around twice the size and thickness of a regular pie. Bought in the name of cost efficiency, of course – those gigantic slabs of diabetes-bait didn’t even cost much back in the day. And though I’m ashamed to say it, we still ate those pies in about the time we used to eat a standard one, consuming twice the sugary, fruit-filled goodness. I got to thinking and wondering, even then, whether eating twice the pie as we had before was a sustainable activity from a health perspective. And, indeed, with twice-sized pies (and similarly twice-sized danishes, cakes, and so on), I watched the pounds pack on as I grew into adolescence, and conveniently wrote it all off as just “growing.” This left me with habits that have haunted me since, and even now staying lean-enough-to-rely-on-being-able-to-walk is a daily challenge, and not one I always succeed at. A lot of me blames those days when I accepted a factor of 2 increase in food consumption and got used to it. Except….wait…..

Have you spotted the issue with the above yet? If not, I’ll cut to the chase – the world has 3 freaking dimensions! Length, width, and…wait for it……depth! The volume of a rectangular prism is the product of these 3 things, and by the same token the volume of a cylinder (say, the approximate shape of a PIE), is (pi*T*(R^2)), T being the pie’s thickness. So what mom and dad were conveniently missing is that when you multiply radius and depth by a factor of 2, volume goes up by a factor of 8! We weren’t eating “twice the pie,” we were eating 8Pi! Sorry, 8 pies. Technically DR^2 times pi worth of pie, which in this case works out to 2^3 times that original pie. You know what I mean.

This was happening throughout my whole childhood. Try it out yourself, go anywhere where they serve really “big” things and take note of how much bigger those things are in a single dimension. That bread bowl looking just a little bigger than one you might imagine making yourself? Maybe just 25% bigger, you’re thinking it’s not all that different? Try 1.25^3=1.95, almost twice as much volume! Small pizza 10 inches while the large is 16, thinking they’re not so different? Even at the same thickness, this is a difference of 1.6^2=2.56. Yes, the large is about 2 and a half smalls. And an “unreasonably big” slice of lasagna that is about twice as big in each dimension as a “regular” slice racks up to an impressive 8 times the amount!

If you’re still reading at this point, then you know what we have to do – it’s time to give that third dimension a second look and see if we really want it in our lives. You may think you’re happy with it, but honestly, when was the last time that depth did anything for you? With just the 2, we would all have flat stomachs, at least in a sense. I’m passing around a petition to do away with the do-nothing, useless third dimension. The petition will be on *flat* paper (go team 2D!) and you can sign it by…….by…..

Bye.

** Note: Please don’t harm yourself with eating one way or the other, everyone. I care about you. But in my case I feel like I took a solid round kick in the back every day for every couple of extra pounds I’m carrying, so I think about it all the time. YMMV.

The post The Tyranny of 3 Dimensions appeared first on Open Math Insert Foot.

They took our jerbs!

Sean Robinson — Tue, 09 Jan 2018 03:53:03 +0000

The replacement of “raw” manual labor or repetitive mental tasks with robots is definitely upon us. This has to be the single most talked-about facet of the AI revolution, and I’d definitely agree that this level of job displacement (at least) is in our future.

One thing that doesn’t get discussed as often is the suppliers and training infrastructure for the new robots as we move into a post-labor economy. I think the common wisdom is that individual companies will produce and engineer their own solutions, so we expect McDonalds to roll out their own line of automated ordering kiosks, and Amazon to engineer and train their own fleet of warehouse bots (note the shelves designed to interface with these bots, all the same size? We’ll come back to that). But this isn’t really the way it happens, new companies generally spring up to design, train and help businesses customize solutions. For example, companies like Zivelo make off-the-shelf, customizable physical hardware to do signage, ordering and other tasks. Likewise, Amazon acquired the Massachusetts based Kiva systems to produce their in-house robotics. So there is at least one market that will thrive in the years to come – producing generally-applicable, multi-use robotics and then training them to the specific needs of a client. But how will this come to pass?

In years gone by, robots were far more well-defined. Purpose built automatons usually did exactly one function, and hopefully had some sensing systems to decide when to do it and whether they had been successful each time. As with the robots used to assemble cars, the early models would perform one very specific task – say, welding two parts together for a specific model of car. If the welder needed to make a different model, or the parts were slightly different the next year, the line would have to be shut down until new patterns could be input. The need for more flexibility gave rise to robotic automation processes (much of this work happened in the 90s). In this new approach, multiple cars might come down the assembly line and the robot would need to sense which model was in front of it, then do the correct task at the correct time. This allowed a lot of efficiency benefits to production, but still was more procedural and specific than the AI we think of today, and the robots were still generally single-task devices (like an arm for putting bolts into holes) that could merely do their job in a variety of patterns.

Even this context-dependent thinking couldn’t stretch “old AI” to new-world requirements. In fact, the world of machinery-production managed to hit a snag with this “single-job, limited-context” robotic picture just a little while ago, as the immense degree of customization on some car lines exceeded the ability of 2000-era robots to adapt. Several car manufacturers produce vehicles sufficiently variable on an individual level (from different hubcaps to specific drive train elements) that the current robots can’t really do the right thing every time – or training them to do so with “if this, then do that” logic would be prohibitive and likely buggy. To adapt to this limitation, other manufacturers have started to use robots in “helper” positions, managing things that would be hard for a person to do and low risk, like holding a part in place for a human to attach. This represents a move forward to “multi-job, human-defined context” robots, and one could imagine just a few humans commanding and overseeing a fleet of such things, providing context and correction where needed and likewise giving the feedback needed for the robots to continually optimize their behavior.

So what does all this have to do with business opportunity today? Well, with all this innovation and development, not a lot of attention has been given to smaller scale automation. It’s my firm belief that 25 years from now, there will be robotic servers at small restaurants, local pizza delivery will happen via drones, and if you go to the hospital, there will be a semi-autonomous robot that checks your blood pressure and brings you meds. But, the users of those technologies won’t be able to absorb a whole robotics company to turn towards their needs, and they won’t be able to employ a robotics specialist to write custom algorithms or change their entire business infrastructure to revolve around the robots either. They’ll need totally general, Lieutenant-Commander-Data (or at least Wall-E) style thinking machines to handle specific but open-ended task lists with complex and changing context. And here’s the kicker – none of that is quite ready to happen yet. It isn’t just the difficulty of making general-task hardware, though that also presents many engineering challenges. It’s also the training of the existing algorithms that presents the challenges. The earliest robots operated with only the simplest context – “if there is a part in front of me, carry out my one task.” This was expanded into “if part, sense one of a few enumerated contexts, and choose which of my few tasks to do.” And with this latest revolution, we’ve managed “act in such a way so as to accomplish several predefined things, e.g. do not hit humans, do not break anything, and accomplish goals I’ve been given like moving a part into a place or getting coffee into a cup without spilling it”. This is a huge step forward, but it still doesn’t get us to “figure out what a human would want in this general scenario and do that thing.” That’s at the very edge of reinforcement learning research, and could be the subject of at least a whole article.

To sum up, the biggest players in their respective industries are about to get a powerful advantage with robotics. As mentioned before, Amazon can afford to purchase a whole company just to make robots for their specific purposes. It can afford to alter its logistics chain and make new physical factories and distribution centers to revolve around these new robots, with package storage shelves that work in tandem. In short, it can produce a top-down solution that only really works for itself and would have to change a lot for any other user. But if I run, say, a florist shop and I want to have a robot helper that chooses flowers, cuts stems, arranges them and puts them gently in a package for shipment, then I have problems. It’s unlikely that anyone will have a Boston Robotics-style automaton that comes preloaded with an “arrange flowers” routine, and even if it did I’d still somehow need the robot to know when to get what flowers. A custom robot-solution would perhaps tie the ordering system to the robot, but again, the florist likely doesn’t have the infrastructure or technical knowledge to do this. So either someone needs to make a company that specializes in making and training “helper routines” to form the brain of your new GeneralBot3000, or someone needs to make an AI sufficiently general that it can watch a human doing their job and “pick up” what happens when and how contexts work.

In my estimation, advancements will happen in that same order, first GeneralBot will come with a small selection of brains and you’ll order the one(s) you want for your set of tasks, and then eventually a truly “general context” mind will be created that will allow watching and then replicating human activities, with knowledge of the surrounding context. So we’re gonna start with “find dirty dishes in this building, wash them and put them into a pre-designated configuration” and end with “watch your owner get out painting materials, paint part of the house and then listen as they say the whole house needs to be painted every few years.” And I believe that creating, training and selling pre-made brains will be a big deal starting around 2020 and thereafter. But long before that, the Amazons, Toyotas and even McDonald’s of the world will have fully workable systems, made possible by their ability to limit context rather than train for the variety. They’ll be able to operate with a lot less labor overhead, whereas others will need to wait for smarter robots to handle their high-complexity environment. Bleak? I’m honestly not sure. But I am very sure that we’ll see a lot of jobs changing in the near future, and displacement will be a real issue, especially with large employers.

The post They took our jerbs! appeared first on Open Math Insert Foot.

Science and Government Control

Sean Robinson — Wed, 06 Sep 2017 16:47:56 +0000

Come closer. What I’m about to tell you is privileged information. No seriously. Closer still. *conspiratorial whisper* The government controls the science you get to know about.

Is this a surprise to anyone? It could be. Right now my country (that’s the USA) is going through some……concerning times with regard to research and science. I say “concerning” because almost no matter what your personal politics, you probably have the same thought on a regular basis: “The government is censoring the science that i like – pulling its funding, never letting the results see the light of day, and generally behaving like a great big….. Big….. Brother. And stuff.” So is it true? Regardless of the science you “like” and would like to see pursued, can the government just shut down and deny science? And the answer is…..well, yeah. Kinda. I’m getting a little ahead of myself though, so let me explain with a personal example. I may be changing a few names around to protect the….me. But apart from that, this is what happened:

I’m the author of a banned book.

Well, okay, I’m the co-author of a banned book.

Most technically, I’m the co-author of a banned textbook.

Okay, put down the pitchforks and angry tweets for a moment and let me explain. I didn’t have the audacity to try to teach evolution to Texas, nor am I the leader of a communist cell inside a liberal arts college. I’m a plain old computational physicist. And around a decade ago I was a plain old computational physicist working at one of our nation’s wonderful Federally Funded Research and Development Centers (FFRDC’s). (https://en.wikipedia.org/wiki/Federally_funded_research_and_development_centers). Whether you know it or not, you love these places. These are places like Los Alamos National Laboratory and the Stanford Linear Accelerator Center. Big names like The Jet Propulsion Lab and Lawrence Livermore. Dorks in lab coats. The Manhattan Project. Reactors! Killer viruses! This is where the magic happens!

And part of that “magic” comes from those first two “F’s” at the beginning. Federally Funded. We couldn’t keep the lights on without someone bankrolling all the flying saucers. And it sure sounds like someone providing the funding gets to call the shots, right? But then… the government of the USA is designed to serve the people, and science is pretty clearly an apolitical affair, delving into the unknown and refining hypotheses into strong theories that can drive engineering efforts and bring us all a better life. Isn’t it?

Well….generally yeah, it is. So how does a banned book happen in this wonderful “apolitical” world? It all starts with the funding and publication cycle. You see, when you do government-funded research, you tend to be one of two sorts of people (well, three if you count small businesses). The first are university professors – these tireless people work around the clock for relatively small salaries and have angry students pounding on their door at every hour of the day. The real upside is that at some point you get “tenure” and can then go around flashing your intellectual, emotional, and in some cases physical junk at everyone and there’s nothing much anyone can do about it. Yee-haw. Oh, and then there’s some weirdos that just love teaching the next generation of scientists. Either way, these folks have a saying, “publish or perish.” To show that you’re a valuable asset means publishing research papers, and to get the results for those papers you need to do research, which in turn means groveling for funding, generally to the government. On the other hand, when you get this funding (and you’re an academic), you usually get it in the form of a grant, meaning that it’s….well, granted to you. Technically, you don’t have to produce results with it. You don’t have to make papers from those results, you don’t have to do much of anything but cash the check and pay yourself. A lab full of beer and hot tubs is optional, but not recommended if you want to keep having a career. But in the same breath, there’s little they can do to stop you from doing whatever you want, including publication.

That sounds good, science as it should be, right? With the science-funder getting what they paid for: truly unbiased results, free of anyone’s finger on the scales of truth, free of anyone’s entire arm using researchers as sock-puppets to bless their baseless convictions. But then….what if you produce results that the funding agency doesn’t like? They could just not fund your research anymore if they don’t like it or the results you expect. So here we have the first and hardest to measure “lever of control” the funders have – they could, and do, just stop funding things they don’t like the sound of.

But wait – remember I said 2 groups of people, and promised tales of a banned book? Well, the other folks are those at FFRDC’s – rather than grants, people at your national laboratories work on contracts, and those come with all sorts of stipulations on the money. In this case, the relevant one is the right of the funding agency to review anything prior to release. While this has important positive reasons (not wanting to reveal things that could be detrimental to public safety or national security, for example), it also allows an effective “science veto” that government offices have after results come out. Under contracts, the funding agency didn’t just pay for the results, they completely own them, and can deny publication.

In my case though, the book in question didn’t contain nationally sensitive or potentially controversial material. A well-known colleague in the field (let’s call him Buford Stevenson, not his real name) got a bunch of us together and each of us wrote a chapter for a textbook (let’s call it “Toward A Better Optimization Order”). Each of us worked off and on for a couple of months, gradually making good and informative chapters about the subject. Or so I can say with impunity, now that you’ll never be able to read it.

As is usually done before anything gets published, BS called the funding agency that had been handling most of our work (let’s call them the Internal Reviewers of American Technical Enterprise) and asked for an okay to publish. He got a verbal “yes,” but from someone who unbeknownst to BS, would neglect to write it down anywhere and would then leave IRATE a few months later. So we went ahead, codifying the chapters into a book, finding a publisher, an ISBN, getting a first printing, shipping to bookstores, the whole nine yards. I was proud to find a first-print hardcopy in my office one Friday, with a note of congratulations as one of the co-authors.

And very surprised indeed to get a less-congratulatory note the following Monday demanding that every copy be returned to be destroyed. See, we’d dotted all the I’s with TABOO, and crossed almost every T. Nothing in it was classified, and everything in keeping with IRATE’s stated mission. But… sometime soon after publication, someone called up the head of IRATE to congratulate them on funding a successful book. And the response was “……what book?” I imagine his head kind of….spinning around and steam shooting out of his ears with one of those train-whistle sounds at this point, but I wasn’t there.

We had violated their right of review and refusal, and this was unthinkable. So, IRATE called BS. They called my organization. They called everyone and proceeded to yell until we recalled everything. Every copy sold to a bookstore was bought back, every first print was taken back, every copy that had managed to leak onto the internet or Ebay within the first week was bought and reclaimed regardless of cost. For all I know they burned them in a ceremonial bonfire. Or maybe they’re stored next to the Ark of the Covenant in some infinite government warehouse. Either way, it was technically IRATE’s right to do that, as well as their right to stop all publication of any kind for years thereafter – and they did just that. Money went in, but public science didn’t come out – results were to be sent only to them and used for their purposes. Don’t worry, it wasn’t exactly anti-gravity we were working on, you probably aren’t missing out on having a working hoverboard because the TABOO book got banned. But then again, that was one small example, and it didn’t even require true politics, just one bad miscommunication. So for all we know, that hoverboard might be spinning around in an empty bunker somewhere, never to see the light of day…

If you’ve read this far, well, first of all thank you. But second, this isn’t intended to be depressing or an indictment of the way government politics operate. It’s intended to show the potential power of a system gone awry, and make it clear that we have a responsibility to keep demanding scientific knowledge and results. Does the government control the science you get to know about? In many ways, yes. Does it abuse that power and keep results from you, or actively shut down research it doesn’t want to hear about? The answer is…..I hope not. I want to believe that on average, science is funded because it’s important to the whole species, and the results are made available as much as possible for those same reasons. Is that a pipe dream? I honestly don’t think so. In most ways, the folks funding science understand they have a mandate to look in all the corners and follow everything that looks like it holds answers, even where those answers might be uncomfortable.

But if you want it to stay that way, then be aware of the power they do hold, and the ways that power could be used against good science if no one says anything. So know, and say something. And thanks.

The post Science and Government Control appeared first on Open Math Insert Foot.

The Monte-Carlo Hammer – Turning Dynamics into Observables

Sean Robinson — Wed, 06 Sep 2017 15:43:07 +0000

“If all you have is a hammer…” so the old adage goes. As it turns out, I have a nice big one, and a corresponding anvil. One is called Monte-Carlo Markov Chain (MCMC), and the other is called the method of Maximum Likelihood. I’m going to talk about the first one today, then later on I’ll explain how the two can come together to form ~~voltron~~ a generalized method to estimate almost anything.

Monte Carlo modeling is actually really simple when you think about it. Often, you’d like to know the result of some process or experiment where a general or analytical solution is really hard. Let’s say, for example, you want to know what will happen if you add an extra road to a city, or change the configuration of some intersections. Let’s make it interesting – say we have control over the light timing of all the intersections in a city, and we’d like to know how to increase traffic throughput, making the roads more efficient.

So, one way to do this is to try to model the traffic as a fluid, or somesuch, and attempt to make all sorts of assumptions about the average traffic density and how it affects the way that fluid flows. On the other hand, this is likely to lead to you assuming or approximating a whole bunch of totally unknown dynamics – you’re essentially trying to guess what the emergent behavior of traffic will be before you’ve really understood it. So, a much ~~easier for the ignorant~~ more elegant approach is to try to model a single car and describe its behavior with a set of parameters that, while variable, nonetheless reference better understood behaviors. So things like the top speed this car will go if uninterrupted, the set of places it may be trying to get, how close it is willing to be to another car, and how likely it is to switch lanes if the next lane is moving faster, can all be quantified. You don’t necessarily know the real values at this stage, of course, but you write them in as variables. We call these the Input Parameters, and they might include other things of interest, like which day of the week is being modeled or whether it is raining, which in turn might set other internal variables.

Then, you use all these parameters as input into a simulation. Here’s where the “Monte Carlo” part comes in. See, “Monte Carlo” is several things: A casino in Monaco, the region surrounding it, and a technique using random chance to generate synthetic data sets or histories. Don’t panic – Monte Carlo analysis is easier than learning to play solid poker and you’ll probably make more money because of it. Basically to do it we begin by simulating one experiment – in this case, simulating one “day of traffic.” To do this we can use our favorite code base – we use all of the Input Parameters and generate a set of hypothetical cars. Let’s say they begin in random places and proceed according to the rules we set out above. “But where does the gambling come in?” I hear you cry. Well, in two places: First of all, my hypothetical traffic has a lot of random elements in it. Where the cars start out, how quickly one driver might step on the gas, or where accidents happen that day all have randomness contained in them. So to simulate the entire “experiment” and come out with the whole behavior of traffic, we will need to roll some dice. In the field of high-energy astrophysics, where I first cut my teeth on this technique long ago, the randomness was all about the angle from which a photon from space entered the telescope and where it interacted with the materials within.

Okay, so you’ve input your parameters defining the world and cooked up a simulation that turned that into one hypothetical experiment (in this case, looking at a day of traffic and seeing how many cars went where). Sounds like a result, right? Well, the careful reader will notice that I said the “gambling” happened in two places. You see, in a choice between being a player and a casino, you’d always choose to be the casino. This isn’t just because the odds are slightly in their favor, but also because they play thousands of “games” an hour, such that their winnings are basically averaged out. Even with a slight loss in odds, as a player you could win or lose, but as the house, you basically always win over time due to the statistics of high numbers. Explaining exactly why may be the work of another post, but for now, suffice to say that when you run a Monte Carlo simulation, you also want to be the “house” and do many simulations to “average out” your result. In general, this usually means picking some observable quantity (e.g. the number of cars passing a particular point in an hour) and then running many simulations to find a good mean. And as they often say, a good mean is hard to find.

So here we have it – input parameters go in, random “world simulation” gets driven by them and iterated many times, nice average values come out. And we didn’t need to know beforehand how the whole system worked, just individual bits. But how will we know that our input parameters were right? Or, more to the point, what if those parameters were the very thing we wanted to know in the first place? Well, the answer comes when you bring the hammer called “Monte Carlo” together with the anvil called “Maximum likelihood” and pound out some real-world estimates. Stay tuned for more about that in an upcoming post.

The post The Monte-Carlo Hammer – Turning Dynamics into Observables appeared first on Open Math Insert Foot.

Consent Horizon: How the EULApocalypse is coming to assume us all

Sean Robinson — Wed, 06 Sep 2017 15:40:19 +0000

I was recently standing in a department store, getting a new waistcoat to further cement the “professor gym-rat” look I have carefully constructed. Next month I’m gonna try suspenders and a bow-tie.

Anyway, during the checkout process, some statement flashed up on a screen to the effect that I needed to know and agree to some terms. It was probably something trivial like the requirement to anoint the vest with sheep’s blood once under a full moon before wearing, or maybe just the return policy. I’ll never actually know, because at the moment it flashed up there, the woman on the other side of the counter reached around the POS terminal and pressed the “agree” button for me. My first reaction to this was “hey, I was reading that!”, followed immediately by “oh, wait, no I wasn’t. I was about to do exactly what you did without a second thought.”

I had just become another statistic in the ever-increasing rights-erosion that is happening, more or less ironically, with our full “consent.” The salesperson could be perfectly sure that I was about to agree to whatever was on that screen – be it a signed confession of murder, or just an agreement not to sue if my clothes came to life and ate the cat*. But to make the case clear for why this phenomenon isn’t going away any time soon, let me go back in time a few decades….

In the late 90’s, consumer culture seemed to have reached a kind of disclaimer-zenith. So many products had disclaimers of use, and they were so intensely worded, that at the time I predicted that every product by 2010 would come with a simple label saying “for novelty only, not to be used for any purpose at all, under any circumstances.” Sounds nice and air-tight. But as it so happened, there was a problem with this reasoning. While it’s all well and good to say “if this product poisons you, you can’t sue us,” it isn’t very enforceable in a world where people don’t expect to be poisoned. How much better, then, to literally say to a person “hey, this could be poison, you’ll have to agree that you understand that before you buy it.” But how to make that fly in a society full of people who still don’t want to be poisoned? Well…..

First of all, we don’t read much. Really. People get a very little way into any content (specifically online content) before tuning out, so a good technique to getting your poison-laced whatever through into legal-land would be to just fill the thing with legalese and make it very, very long. According to some careful research, the average EULA would take the average person nearly 40 minutes to finish, far longer than it will take you to read this article. So, check.

“But wait,” I hear you cry, “these things are important. You could put the whole Principia Discordia in one of these things and then I’d have signed it without knowing – of course I’ll read it!” Well, unfortunately, no you won’t. Meet Vigilance Fatigue, the pattern of gradually eroding mental capabilities you face when dealing with this sort of “complex information overload.” According to the FBI:

The final class of threats to sustained vigilance resides in the nature of tasks and the quality and quantity of involved data. Information overload becomes a critical contributor to vigilance fatigue by creating data and cognitive clutter, fostering workload bottlenecks, and impacting an individual’s ability to detect significance in data.Overload-related issues are complicated further in cases where the data is novel, ambiguous, complex, or conflicting. These circumstances undermine the utility of tried-and-true cognitive schema, expert systems, heuristics, and synthesis strategies and reduce decision-making certainty and confidence.

So even if you want to read that 35-minute EULA, somewhere in the middle your eyes will cross, your vision will blur, and you’ll (at least) need to go back and try to summarize. And between the labels-on-everything in the 80’s to “exclusion of all uses” in the 90’s and finally super-EULAs in the naughties, we have, as a society, given up.

It didn’t seem so bad in the 80’s….

So what happens to the companies who jumped on the “metally-fatiguingly-long-EULA” bandwagon? Well, among other things, enhanced legal protection for whatever policies they wanted to enact. But what happens now – will people stop buying these products? And if not, what does it mean for the market? Well, buckle up, because we’re about to find out. See, the demand for products tends to revolve around the cost and the willingness of people to pay that price. For the sake of argument, here let’s assume that the monetary (i.e. dollar) cost is fixed, and let’s instead focus on the cost to privacy and rights. We’ve established that companies get some tangible benefit to adding more “consent” and “assumed consent” clauses onto their products. If people are very responsive to that additional cost, then we have the graph on the left, where a little more cost added means a lot of people don’t want the product as much. We’ll call this behavior “the 70’s,” in a shameless appeal to the golden age, and note that it allowed companies to only impose a little of this “cost”. But if people don’t “feel” the extra cost very much, we get the graph on the right, where “prices” can rise quite a lot without risking people dropping out of the market.

Inelastic Demand Means bigger “price” hikes until….

So the market raises prices. A lot. And lest you think we’ve neared the end, extrapolate a little with me – as the demand line turns ever more vertical, that increase in “price” by way of giving up rights and knowledge can go up faster and faster, rather than reaching a set maximum. In effect, if the “demand for rights/privacy/actually informed consent” goes vertical, then we will hit a “Consent Horizon,” in which essentially all rights are forefeit upon encountering the product in any way. the moment when all liability is assumed by the buyer upon even the intention to buy anything.

So the lady at the mall had it right, by just pushing the button for me. And what had that screen actually said? I dunno, I wasn’t really reading it…..

* I do not have a cat and no animals were harmed by ravenous animated clothing during this story.

The post Consent Horizon: How the EULApocalypse is coming to assume us all appeared first on Open Math Insert Foot.

The “Telescope Effect” in brainstorming

Sean Robinson — Wed, 06 Sep 2017 14:48:02 +0000

In my time working as a government scientist, I was able to work with a lot of cross-disciplinary teams, often putting math-and-code-heavy folks like myself together with domain-specific specialists like chemists or biologists who worked in wet labs more and made numerical models a bit less. I found the power of brainstorming is much magnified when working with folks of diverse perspective and skillset, because of what I’ve come to call the “telescope” effect. If a researcher isn’t careful, as their knowledge of a subject gets deeper, it also gets more specific, and ultimately their day-to-day thoughts and feelings about the applicability of their field may become narrower. As a result, many of us are carrying around really great concepts (say, a novel way to analyze data) but never thinking about the other problems that those concepts could be applied to, because our individual focus has become like a telescope – fixed crisply on a tiny region of the overall concept space. Often all that is needed is a person with a different focus to let that telescope re-point at the new problem, and all of a sudden the power of the technique can come to bear. This is one of the reasons I’ve sought out problems that reach between disciplines.

TL:DR, a lot of the great new ideas aren’t really all that new, they’re well-established ideas that have been working well on different problems and will work just as well on yours. But often those ideas are locked away in little domain-specific boxes. Go find people with very different backgrounds and experiences and open some of them up and trade.

The post The “Telescope Effect” in brainstorming appeared first on Open Math Insert Foot.

About This Blog – Doom from the Machines

Sean Robinson — Tue, 27 Jun 2017 04:27:24 +0000

This entry may change over time, as the needs and purpose of my readers vary, but the core will remain the same.

I find myself wondering what technology must have “looked like” to folks just entering the industrial revolution. Strange engines running on magic and science, allowing one human being to do the work of 10 or 100? That sometimes detonated or otherwise disassembled, maiming or killing folks? The threat of all of the jobs we knew going to these new things? A society where the richest people could buy them and dominate the production industry? A world where knowledge of arcane engineering was the “new ticket” to riches while old industries crumbled to dust, sacrifices on the altar of progress?

But then…. the promise of more shoes made in a day by one person than ten could make in a week before. The ability to harvest more, plant more, and generate more food for the same labor. The promise of more knowledge, more communication, more time for learning and more dissemination of knowledge. A better world for everyone. Could both these visions be simultaneously true?

One thing I’d lay money on is that everyone felt the need for more knowledge. Whether you felt machine-fueled doom upon your livelihood or saw the dawn of a new era of prosperity, the world you’d have lived in was increasingly dominated by arcane knowledge – in particular, the knowledge of the forces of production. Today, one of these great forces is the analysis of data and production of numerical models. Take a look at the list of things my own professional career has touched on, and try to tell me it doesn’t sound like a list of desired skills for a “court wizard” in the middle ages:

Prediction of weather patterns
Production of automatons for manual labor
Determination of the secret motivations of potential opponents
Predicting the outcome of one choice vs. another on the part of a decision maker
Detecting dangerous material or weapons even when hidden

Provide 2 professional references, must bring own wand, position open until next vernal equinox. Only now we’re called Scientists. And a lot of us were dug out of labs, professor positions and other gigs to answer the data-specific needs of this new age. “The more things change,” I guess.

This is getting long, so I’ll wrap it up for now. If there’s one fundamental I keep coming back to, it’s that knowledge is for everyone – and I mean everyone. And with the stakes raised sky-high in the new world, you need it. But chances are, you haven’t spent your whole life marinating in math books and code, and have little interest in slogging through piles of experimental methodology sections of papers to get to the heart of the matter. So I’m going to employ another bedrock principle I have, which is everything can be easy. So we’re going to take experimental results, contentious topics, and complex analyses and make them easy, understandable without leaving important things out. I’m not going to insult your intelligence by suggesting you don’t care about the details. The fact that you’re here means you probably do. So that’s the goal – all the specifics you want, starting from “no knowledge of the field” and building up how it all works. Explanation from soup to nuts, for your consideration and use. Hope it works!

The post About This Blog – Doom from the Machines appeared first on Open Math Insert Foot.