Add to Watchlist

Lies, damned lies and scans

12 views

Citation of segment
Embed Code
Purchasing a DVD Cite video

Automated Media Analysis

Beta
Recognized Entities
Speech transcript
so I know everybody nice to be here I hope everybody can hear me so if not just tell me something and we can do the mike alright I'm that and computer scientists from 1 so actually pretty close by this conference I come from in real life I work and you
on topics like Release Engineering engine machine learning and data science but this presentation is about something completely different you know when he was at I. T. security related presentations and afterwards there's there's always some kinds of devices you don't really like to use any more so a little disclaimer if somebody has a special relationship with this copy machine so this presentation is not really for you In this presentation we will do 3 things 1st we will look at 1 of the most common and most dangerous facts in the last years 2nd we try to make it plausible for techies and from non-techies and 3rd and I believe that such a story might happen to just everyone here so we'll look at how a single person can deal with going against the becomes so we will be talking about how this developed and which mistakes I made and you would derive take-home messages for you to rely upon in case you face such a situation are the presentation is a little bit like a novel so and as such it starts with the PLO approval for the conspires the theorists among you in the year 2008 In
2008 there's no
job but people are laughing already here the primaries took place in the USA with Obama against Hillary Clinton and is every everywhere else politics in the US a comedy of intrigue and so that was anonymous yields that that should be useful to delivery it was held that Obama was born in Kenya a Kenyan citizen and that's not fit for being a president because he needs to be a natural born citizen of the United States when the term natural born citizen is actually not properly defined as the Americans are not really sure from the form of for themselves but there's a consensus need to be American and you need to have been American at birth and you can imagine that about 2nd name seen was also not that optimal in that context so then to have it done Obama published his short from birth certificate depicted on the left and if you're a good conspire see the arrest and not going to be dispelled by effects and like immediately there were claims that those of us certificate was false like a standard was not on the right place and stuff like that was held no
no on the right is the sum bumper sticker the 1 explicitly asking for the birth certificate so even though Obama won the primaries and the presidential election 2008 that followed the controversy did not really stopped the theory about my doesn't fit to be president surprisingly unpopular in the United States business named the above the movement and they want to prove about that is not a real American so this is actually
true so 2 and a half years later when Obama was already President the discussion still wasn't always so Obama was was really pissed off you can imagine about not published the long-form birth certificate shown on the right known so as you can see there's more information so you could say no it would be quiet but again there were claims that the birth certificate would be would be faults so let's look at that that that in more detail the
left image null is a surname I this is the strong magnification of the place mark correct in the right image that between that between the different letters between the 1 and the 4th for example a conceit and there is a clear differences in gradient and sharpness the 6 and the floor a perfectly pixel at shock and uniformly colored 1 the 1 is slightly and shot and its color has some noise like you would expect from this connection with so how is that there is such a difference in 1 and the same line In this
additional example you also see similar differences between the
2 boxes mark so again 1 like in that when is like you would expect it from the scan and the other 1 is shot on the pixel level like somebody drew it with Microsoft paint
all at this is pure roll of this
part of the images talk taken from the stem and it looks like there was a typo in the system yeah sure right now know how likely is that so of course people believe the birth certificate was forged when they see things like this and additionally 1 believes and the White House and furnaces produce fortified social costs for a
ban on so that was a major PR fucked up adjusted to see this thing is huge and in and CNN poll of 2010 at least 11 per cent of the US public believe Obama wasn't born in the US state and an additional 16 per cent that he probably was born in the US so this is a quarter of the US general public and even today the White House gets request for comments and now we jump to 2013
the 20 4th of July 2013 the company called me they have to be external extract axelrod's work centers those are huge business copying machines that like every company has no they've network that can scan print and fax and e-mail and everything
and because as much as the car and that's not given printer but they may have hundreds and hundreds of users by machine and on the slide you can see a blueprint and the black lists are not originally and let them not because I couldn't have used this group and otherwise and there are no 3
general places the yellow places are standardized box which show the area of the room and square meters we talk about those not the company said hey David if we scanned the blueprint to a PDF then there are different numbers on that can you take a look at that for so the left that's me you know that I have to say that I always had a good atmosphere then and I have been doing that she's service for the company a long time and obviously I thought they were pulling my leg and so sure right yeah a cop changing numbers for here that every day of course yeah that's what I thought I heard this and they said no really really come look at
that look at that now we need this machine it has a OK so I
went back and took a look still thinking yeah and that's just prank and they have stocks work centers 75 35 and here are the 3 origin image segments before the scan so also we can read that at the top there is 14 . 13 square meters and then there's 21 . 11 square meters and at the low end there's a 17 . 4 2 right so I put the blueprint into the work center and understand we use the same place often so so far and it's quite funny right the and also all the rooms now have 14 . 1 3 square meters and no way that's impossible isn't happening now I still thought that was a prank so just to say just to say that and and I had to say there's a lot of times there's there's no are involved rather the prevalent replacement of numbers happens at the pixel level the company at another work centers the 75 56 that's larger and it's bit faster and there are many many more like this it's it's a huge family of machines on the small works into there are always the same numbers that can like see and on the big 1 every scan there were different numbers of speakers so there's more CPU power I guess and look at the numbers for example and place to with this is the role in the middle 1st we have 14 . 1 3 skirmish and next can we have 21 . 11 and and that the latter would have been the correct value so after all there is a chance that you get caught numbers so anyone know anything is a secure random number generator you can ask whether the service company and they have 1 so also and this is not this is not very funny but I'm not going to low and also the numbers look absolutely perfect in the layout and then they only notice that because the room that's obviously bigger has a smaller area when the smaller 1 next I know that the font is very small but it but is not some strange conic case we have some more examples but this is the origin example where refunded and I want child 1 to show that to you so this next
example this is a cluster registers you see here that the 6 turned into an 8 and the joke is really the later I put this image on my website and I was like Look here's 602 and next summer e-mail me a little kiss number 1 in this but again looks perfectly clean the layout and looks that looks perfect you can't really see this time we noticed there are because the numbers were sorted in ascending order consequently the numbers make no obvious sense to you you can't see there wrong you always have to have some semantic criteria that make that text plausible otherwise you don't find out so as you can see my neck gets longer and longer so I so I try directly using it like like a good computer scientists and it's so at might I made columns of numbers in different sizes and different forms and scanned them are you good
and scan them and lo and behold
I could reproduce the role of that data that yellow should be 6 and should not be the case that we see the so if we stop for a moment here I promise that I'm going to show you the entire interaction with Xerox and I'll tell you what
I felt at different points in time and every time I stressed most things that are very important if you have to find
and so the company I will also show you proof about this but I will tell you 1 thing up front in my eyes but does not help is becoming and hating publicly in particular by Twitter this is not helpful I have no problem with future at all but if we want to do something if you want to achieve something you just make yourself a target and an idiot and nobody is going to take it seriously and they will tell you that you don't really want to have a discussion because that 1 fit in 140 characters they can always say that the the only 1 that show so if you I don't want you to do these things what what what do I want you to do of the best thing is to do to not make it publicly initially so the
right units are called so we could so supports several times very often because every single support level up to top level in government and nobody knew single thing we want to present
context of love so absurd to people who were at the site they didn't know anything either so there were not really of sorts people they were just walking in commissions some distributed in all so they tried to replicate use there also and the that we are not going about
this them but but but they were standing there with their pants down in it you know imagine somebody's coming along and it looks like he's trashing your existence
subject sets so not the support company but the barracks Xerox company the there are surprised yes but that they didn't really try to help
us all the support for so such a thing you say they respect the problem you so well that's all and there were no signs of the Ganges or or any advice how
we could get rid of it and somebody came from Xerox and gave us the it didn't help but I was like all right now we know that the problem existed for years ago that well as well as well but after they haven't been a sign of progress on the side of trucks after more than 1 week I thought right that would be
enough so I wrote a blog post in German and English about what I just told you know and offered some test documents the readers could dollar to print scan and see if they were effective that's how the story
started to suspect I have to say my block is a huge not at all maybe 500 thousand readers and a that's not a lot but it's nothing either most of them
minority people that's what I know from the moment that I get 2 on the bottom of my slides you can also see it's a plot 1 of the page views it's gonna get wider and wider as I continue to speak it's a it's a sign of how much attention I'm getting at the time so so let's test that is seen more about their at the peak of 3 thousand hits per hour the numbers come from Google Analytics I've been told you're supposed to multiply them by the by the by 2 or 3 but never mind you get the idea on the 2nd and 3rd of August the stories spread on some textbooks blocks the picture is from the 1st block of and the German people in the audience probably 1 so the source but to make it more and more e-mails from people who are affected the funny thing is I also get loads of confirmation proxemics work centers have never heard of so I told you it's whole product for a family and so I'm starting to think this might be something you so lesson learned it was meant to provide the test documents right away if I hadn't people wouldn't have
been able to reproduce it for themselves and the story wouldn't have spent that much for the
forest the stories spread worldwide intercultural in the picture 2nd use you probably know it and I'm getting hundreds of e-mails from people with technical knowledge I start spending a lot of time to channel the mind getting and only that allows me to continue the story and to follow the arbitral down to the bottom and that a lot of sleep anymore because either because because I started getting lots of calls from reporters who apparently don't know that timezones actually exist lesson learnt right in multiple languages important of course English for the international readers but also the home language of whatever company you're targeting you may know that there exists a widespread in the US that there's actually a word for it it's took several and whenever something goes that they can detect would what comes next
this medium so that's where it starts to get you to this is speed it's the largest German news part of it and I click through a few that's not an exhaustive list not at all but there are thousands of articles worldwide and I'm just going
through it however it's it's much order there's no statement about published dates of the articles in as a side note in before a German journalist told me that he wanted to bring it into tortoiseshell 1 1 of the biggest German drawings and they told him yeah yeah yeah this is
a really cool but we want it to happen when you photocopy not just when you're scanning so maybe someone someone should tell them that if you printers can you need to have a photocopy and never never mind the lesson learned don't make it together and it is just because you want the attention the
economists now it gets really
serious and also expensive in terms of PR for the company and and he also can see where I stole my lecture to the from Hey
there ABC News you more expensive they receive
suddenly was everywhere in the universe feels completely surreal and if you do a shift if you pick breaks in such a situation I want if I myself getting up several nights just to prove read
my own articles and make sure I describe everything correctly that's what I did I wanted to make sure not to get sued on millions of shareholder value there right at the start of non professional life at least that's what I thought we get to that later so this is business
make it's a popular business magazine an Antonov now there was no reaction from socks and if you react that slowly it's getting pretty uncomfortable becomes a
citation on the scale of things too horrible to contemplate document altering scanner is right up there with flesh-eating bacteria that the original citation from Business Week and that's Peter colonies editor their head and we will get to him a few more times in this
presentation not know my blog post went up to 100 and run 100 wasn't readers that date and still no reaction from tourists in the meantime and managed with the help of my readers to show what actually happened and in a minute I'll tell you about that in a small sliding about image compression but before that small remark if this thing exploded I 1st used some months but not technical terms in my descriptions for example I
used pattern instead of symbols so now after that I got some units which which criticized the correct me which I am thankful for however if this happens to you don't feel discouraged by mistakes you make in the details at the time had to deal with rocks had to
write articles in German and English and I had to stormproof my internet service you get about and I had to do with the press and additionally have to understand what was happening in image compression level and just that in all at the time I had no academic interest in image image compression because I'm more the machine learning and data science kind of guy so that's a lot and normal management minds means to to not let anything catch fire and but rather something is completely different there's a lot of of fires already burning and and you know what you're do doing is running back and forth between them to just make sure none of them that's all control so stay and 1 of the 1st among you're done for so lessons learned to make sure you understand what's going on but don't lose his 72 hours no
for the image compression this is a test image I photographed both the photo and the text of part of the image so that we have different kinds of test image data and you know the data transmission is time money and storage in terms of so you don't want to transfer images uncompressed that's why there are lots of compression algorithms for images no
there's 2 parts of the test image 1 of 1 of the photo 1 of the text they're very much large so so we can look at pixel level what can go wrong with different compression algorithms so there are lot less compression and near the methods and when no information is lost or if you want to get the file even smaller than there Gnostic Codex this a
popular format this give this lossless but there's only support 265 colors so the the the the loss of information comes from the color reduction in give is good for graphics with few colors and sharp edges are preserved well so as you can see it's less suited for photos
now we have J pick which is lossy in the slice of the original picture an 8 by 8 pixel blocks and the proximity with cosine wave how exactly this this is mathematically done I won't believe it this is really good for photos but that for text and child as you can see so different compression algorithms are good for different kinds of images and more
advanced compression strategy involves slicing the original image into several sub-images we call symbols if you know the kind of imaginary every somebody can use suited confessions over them far for each of these symbols together very nice images this this basis that don't belong to a similar don't even need to be saved you probably agree this is quite an improvement compression but you can use this lysosome assembled approach in and out even more advanced sleep you can see every letter is a symbol this is actually done by the make of usually that's what is done when an image is
compressed together to form gender to isn't much storage Cordic that's that's especially suited for images from scanned text next
encoder that's what symbols are really similar to each other like
the symbols are marked here that's all small pieces that's why the and therefore only very few pixels that are different this step is called pattern matching so you can groups of all the similar symbols and for every group we saved only 1 representative symbol and that is used all over the image instead of the other group members only those only 1 of those is actually save and consume storage space and all of these are applies place where the safe when you can save lots of data and don't have to fear but image quality and here you get the result so there's
some quite nice quality and use much less they don't use good looks good right you see the that matching things that the smaller the idea is similar to that of that's what's happening when pattern matching is not exactly accurate spoken simplified data to has to usually modes of operation it can be encoded velocity in this case it works exactly like this or it can be encoded lossless in this case in an additional step errors like these are corrected before saving the image it seems like there exists a pattern matching approach an accident and left not the correction step so get these also was it well so that's dangerous mistakes no my compression artifacts are not let problematic that produce unreadable text and you can see something from the from but here you have perfectly looking layout and let us that actually encourage you really have to read them to notice and even then you don't usually notice them because if they don't make the whole document implausible on 1st glance so you just don't don't get them like the blueprint also I don't know what you do but but I don't usually proof right proofread scans after after i've produces them so politician west to put this in a positive light would say and use such exerts machine to scan the
medicine dozes of a retirement home and there's a good chance that you're you're really relieved dependence Office of from paying rent must monthly income so people this year somebody from Berlin notes or otherwise could fast and which which machine they used to chew which apply the blueprints of the airport FIL so airports medicine elections as because most are and it's all small stuff if such stands were actually used as evidence in court where that's right gets really interesting so now Husuni on which with such as good as the scanner I would just say you know that's faults and you can prove me wrong 0 can't even prove anymore that a place in the scan comes from the corresponding place in the original paper document that into legal value is 0 and that's business appliances there are hundreds of thousands of such machines each of them has maybe hundreds of users and even more people getting handed over documents by the machine so for example I was called by
a big business this can every incoming mail automatically and from this point on use the scanner so if there if there is such machines could not so we'll get back to the implement and implications later but but let's
go on with the story it's not the 1st of August 3 days after the 1st impact and then God created the life of Xerox the PI of excerpts Germany called me and was obvious they they can't they can't really do anything without the Americans they thought it was just
a prank and I said 0 no it's no joke and we agreed to stay in touch with the use of the head and they after that the 6th of august it was the 1st unwarily things started
going in the morning I got this
screenshot from the reader from physics xerox machines that uh it talks about character substitution not so there are 3 PDF compression volts there are normal higher entire very pr compatible names 1 moment is the moment that compresses the strongest there it is said in normal they are almost never and in the 2 of the stages it's not as far as I could tell at that point in time that was correctly learned that later and I promise I will tell you how often during the and to be honest honest I was a bit different I I thought I was being portrayed as the idiot who didn't read the menu so to know there was no official statement from Xerox and I was tipped off the church was about to write something like that so lesson learned that the internal you can be altered you is sort so what's the deal such a problem must never
happen not even if you know about it so but from the inside the word looks different in such a situation that so when such a sweet story happens to you guys and your partner but if you mind column from the beginning on and didn't hate you can just ask only in the public wanted support me that 2 weeks ago so I started
some some forward defense that this is not at presented the screenshot on my on my block as a possible work-around and I recommended to such compression too high and I was curious why support couldn't tell me there and I also criticized the setting was called normal and and all the
consequences remain of course and I wanted to get the story is that has been because so it's a big costs articles was about to get on my back and not exclusively the same
afternoon there's a conference call with investment and corporate vise president of Xerox and Francis which is 1 of the chief engineer and and guys that that's really something there you see extracts the boss is doing support himself sorry testing is the first one to my talk will actually does
bark at the rocks who confirms that character substitution was indeed known at Xerox so if you have a problem and call support and talk to them for like 2 weeks and they can't tell you anything that when you asked to make me up his talk to Mr. distance that and confirm it was correct but the pattern matching is responsible and they also confirmed it is only done in normal so we we agree that the 1st support for that and 2nd normal was probably not an optimal name for this setting I recommended experiment they are right right now it's a lot of fun here but at the same time I was quite a bit scared believe me and that's asking me a fairly crystal clear RTF and which is abbreviated for the fine manual
1st the normal in the factory default there they told me do customers you you also stupid ideas attitude different settings and 2nd the manual also says that led to replacement cannot use all of your all doubly stupid so what if only half the true truth for the customer course factory settings is the set up which which they get the machine and they don't usually get
different stocks themselves the 3rd party companies from which you get to the get the machines and they do consulting for you and they set them up to whatever settings as for the manual actually there is the mentioning in 1 of the men yields on page 107 of 228 and they're all enough to know how many people always read manuals 300 pages thick before the user copier I also was of the opinion that copiers must not be designed in a way that such errors can have more than 1 ever setting no 1 no one's expecting the gas and the answer was was of 0 yes that may have and the market demands the small document size the additionally I was right and and they confirmed that you can't prove that a particular document document has no parents so if if I claim it's false you can't prove me wrong overall there was quite nice atmosphere and they didn't threaten illegally and they listen to me and it was a lot of talk and and then I really let myself be caught in a trap remember I never did anything of that scales and extracts of course that professionals and after a while I was wondering why we could talk so long and in that relax atmosphere despite this in his corporate vise president of a huge company and you probably have other things to do so when not it
turned out that during I was on the phone with them and they released a press statement not stupid because that sometimes and which I can so always listening to all customers was the title that in deep field yeah good and they and they think who once they tend to needs to use a compression setting of higher higher and furthermore they say that many of painful RTF lessons learned always have someone else watch the website of the Advisory when you can create
so I also wrote an article on my back and told about the telephone conference and reported about what I just told you and I also wrote that thing they're off the hook
and no 1 that could have been the end if a single guy finds a huge company either the company should spectrum the guy caves in all the pomp public insight was that the company of the public just loses interest and nothing of that have you can see the huge spike in traffic and the number of articles on the title page of slash dot unfortunately the press
got on my side here is high is the most popular German IT portal and they they stated that I presented the work around you before statistic which is nice the and
here just because stated on soaks Xerox knew about the problem for years no if you know that something if you if you work on the for the P r of a company and something like that happens that that's really nice to
construyó holidays for the rest of the year and it gets it gets funnier and funnier if you have ever been to the US and if it really starts when it when it really starts getting lucky they say shit hits the fan the next
day my blog article was on Reddit and you can see the next spike in the plot and what is he had read it is the nicest most eloquent version of shit hits the fan that I have always been he could matter will indeed hit the rotary Arikan polar who and so on what the guy right is true if a company relies on digitization of documents and and nowadays honestly who doesn't and they all have a
problem they can close the shop really that so to give an example of a state-run archive called me and they created the entire archive with sorts machines and and what they do next well they threw away the originals theory they then I would have to look at the documents and check them for ability and and
even if they did then they couldn't be sure so if you ever see some people of an archive of staring out of the window gazing at nursing is the 1 you know way so also intended jokes or other nice this is my neck all
you know you know we are in lecture hall maybe it was supposed to be a whole 6 but then that's that's can't that you print and sometimes they're really funny jokes come from the protagonists themselves so if your corporate vise president x Xerox and have to engage give interviews on the
same topic all they something something that just will slip through you don't need to
read I just read it for you speaking to the BBC the testing wanted to relativize the issue and said OK well this is it's not really that bad yes the normal compression what can create our but but nobody really uses it example for the leader for example the military and all your expenses such so what's the
worst that can happen right there that as you probably know all the problems along the are not looked upon very kindly in the US nowadays yet so perhaps this is the right moment to say again a bit of a line officer came but but imagine yourself and
distance shoes imagine you really have to talk about this in interviews on the phone for like 14 hours it's more than human that you units said something also he told me after he had been quoted out of context and I no no reason not to believe in that but still I thought it was funny here's
a take part that is happy that kept images seem to be not affect the so as efficacy see they're not sure reportedly there and here is
also the new press release from so under the public pressure they say all yes perhaps we're going to make a pledge in which we disable the pattern matching stuff so they never admitted a mistake of problem loans understandably they have to cover the rest so what if you wait
that long and then even announcing a patch doesn't help you use a newspaper that would Teixeira like error in the title on purpose the but no that's collective exert statements there is an important and cluster crystal clear message in you
will not see a character replacement if you set compression to at least higher with at least 200 dpi they also published glossy documents saying that pattern matching is only done in the normal compression mode
and not in the highlands and I'll I quite sure I have seen the problem high on what's and end-users told me that to unfortunately I didn't manage to reproduce it on on my own devices that they had access so I'm just not putting out a new world because if it did happen in other modes and then everyone would be affected and excerpts communications have been misleading and they had a much bigger global problem that it would be a disaster but a friend of mine in 1 where I was living at the time he had work centuries 75 45 in this office and when parents can make tests and and use the higher and even 300 dpi so we actually were quite generous
and now look at this the yellow numbers have an I just I just got some of them and I don't know if I got all of them but still you can see how frequent and can occurred so I repeat that it was mode higher with 300 dpi so this
is when you find a about it's unlikely that you see the photograph of the issue at the beginning the so we not take the blue rectangle and have a
closer look you see some groups of digits that are marked the digits with indirect rectangles look absolutely identical on pixel level and that's very unlikely if it's always going to look a bit different if you scan it naturally and lots of digits that look exactly the same pixel by pixel that's a clear sign of pattern matching so despite what Xerox told us that energy was done here 1 of my readers but the nice result only isation that marks the crew digits in red their mark when I hope them with the
most I can see if I can get it to run the other
areas and now I can however when this CDC the runs those are the very same and this is the real you read usually and frequency of of of the 2 and you see there are several variants of the 2 so that's how it works and you can
see how many there are so now
on this point it is clear that hundreds of thousands of large business the lysosome
worldwide are affected in factory defaults that publishing something like that you can you can really image of the company so I didn't want to publish that without at least trying to talk to them before I'm sorry yeah I I I I I I can't I can't to earning money with that later this is this is really interesting point down don't underestimate to tell you to tell me again how I feel that this time I was there was lots of pressure so I want to make sure that I did not make any mistake and could be sued on on the loss of millions shareholder value and historical and the whole process of producing 1 wrong gets digits on video and uploaded it on public on youtube and then I sent a link to Francis the principal engineer mentioned before
and then there were the chopped inferences told me on the phone that I did everything right in my video and extracts must cooperative but they wanted me to wait until they could reproduce and they're all by themselves however I I
remember I I felt a little bit shocked over off the last conference call with them so
I said guys you know what the standards that differently so please be informed I have a blog article written on this ready there and as you can see
that they already uploaded the video to so don't take this the wrong way but but this time I wanted to keep me in the loop and that's how we agreed and there's a lot of falling back and forth and because of the time the landlords are literally spend the nite in the office and only had some biscuits and finally Francis Collins mentors me if we have reproduced at character replacement and factory defaults everyone was a big the and you know what they told me later the code for the compression kernel was 8 years old at the the time this practice is in the way that while it has been in the wild and since 2006 and with that being the case that we were all of that stuff so the excessive have looked at my article and make sure that I would like to thank the so I push of the I think that's like this really dangerous and and I don't want to wait longer and and that they digits and even
and even a lot me to publish it before they published something to and that's why we shouldn't hate that lesson learned to negotiate at the right moment right after what's
so export their own state and they they take back the previous communications and even said thank you and wrote the road then I will have to investigate how big this issue is and that's where they started to be really nice and the press statements and the whole climate was the reconstruction
in the press because of all the flip-flopping the whole thing becomes more and more serious business let's start again look at the title apparently to them it doesn't matter what stock status but only when they confirmed to me here again is Peter call from
Business Week and then I got some 1 more for you 1 more for you I mean no 1 1 compression never mind knowledge
at the 11th of august I can actually prove that it happens on highest mode as well so so even people willing to create create massively beautiful pdfs and couldn't escape and to be honest as far as I know only doesn't happen if you can to to
on the 12th of august shorts confirms itself about announces the patch another time in the middle of the nite my time this tendency called me on my Mobile to be the 1st to tell me that they can't the buck back and they hear rolling out use of that to all devices uh from from this we can see how nice the atmosphere becomes that's the
patched on the side from Xerox and here we can see for the 1st time how many devices are affected by this the end and look at all the excess of device families and it took into the 22nd of August onto the 1st pictures were released and you if if you think that's long
I think different I do the release engineering admired unemployed by the you and I can tell you that running all testing and use of release for such a large and long existing device somebody is really really trying to you was also there were quite fast and the next
stage the process reporting again and for example the German computer magazine CT has a report and cause the whole thing scanner gate and here political now
puts the and this stochastic but Peter completely right 8 yearly productions of scanned documents across tens of thousands of enterprises worldwide can contain these errors and and damage things by and to
understand the full monstrosity of this uh reflect on the time we are living in we living in society that right now now as we speak is doing its since transition from paper to digital and the translator between the words of I devices like centers we have we have to we have to put up with this a long time In computer science them slowly grows out of its infancy and as you can see computer scientists have a social responsibility in their own way and in particular way more than they think that and here comes the most important thing of the talk I told you Csirik so it's most of its devices through 3rd parties that they told me they don't even can collect the complete list of their customers to notify them so what we can know if that's the right but in any case there's no reason to believe the patches have reached too many devices at all so pleased that the words the words for example a few weeks of all I was at a Boston and I know some folks working at MIT and I was told that there MIT hasn't patch that it so now I know that I give this talk in English finally I hope it gets viral enough that there might you can get the message to perhaps you you can all help and besides all the
lessons learned there's 1 little effort told you don't always get unbelieving looks when I when I tell people I hadn't and of the signal sent from sorts the manager even told me I was an idiot so I know give you the information you can decide for for yourself how much of an idiot I am and 2 things firstly it's really hard to earn money with something like this Enterprise of exercise get threatened by some track of every single day and without proof people are not going to take you seriously and and only if you provide prove it mostly directly
units to the back no money again and secondly corporations don't no friends if I had asked for money this would have come to light and it would have looked sleazy regardless of the money or not I would have been shut down with and
maybe most important of i've been plagued by TSA rocks I would never have had negotiation position that allowed me to actually demand a solution asked for getting Martica's proof that so last but not least and lots and lots of people all across the world of me and they didn't ask for compensation either and I would I would do it again like this but at the end of the day everybody has to decide for themselves if you want to do differently that's perfectly fine but know you've been here and and you know in before you might make new negotiation position and these are all the lessons
learned and not going to go through them again but if you don't the slide here there are and in melanoma we don't know jump to the present more precisely too much this year and again
you don't need to read that I'll explain it to this the website of the German Federal Office for Security in information technology they are in charge of managing IT security stuff for for the German government as the best busy as a consequence of stocks are around this March they declared all spins using pattern matching and related technologies has not major legally safe this includes apologetic 2 scans but even then the lossless version of genetic to encoded images the effective independent from the lecturer manufacturer of the scanner in other words this affects all manufacturers so this is an even bigger than it sounds as far as I know the authorities in Germany have to follow the scanning guidance of this Federal Office as well as lots of enterprises that have to do with the government stuff and it sounds like they have to rescan so in a sense we are creating jobs here there's a similar system institution that made likewise that decision the German one the Germans are not the only ones so OK what what remains Bach Obama's
birth certificate here it is again shortly after the external things journalists from reality check in the U. S. you see the link and asked me but if the church but might have been the cause for the phenomenon observed in the birth certificate and they don't detective work already for instances for
instance the Obama said published the textbook right before the birth certificate it had been scanned with work centers 76 55 and I wonder if I get the numbers right every time and they asked me if I could ask Xerox
because because I you know knowledge connection and understand the research they know already there it will be really that different things and all lines right now the sign at the end of 2014 I look back at the birth certificate
PDF and have a look at this the PDF contains except duplicative of character who assigned for uncorrected pattern matching in Excel oxide and when you look on the internet pages of the of the conspirational period they they also say something like duplicate characters and then they thought they were copied on purpose in order to flush that certificate and and
here for example there are 2 boxes that look exactly the same I mean make make up your mind but but I think this certain for a theory might here by the shut down and all I have to
say no thank you for spending this all with me and you find the link to that stocks Freeze he spread the words and and later I published must slides my webpage as well and last a police congratulations frost and it's amazing what you did the 10 years thank you all any
questions yes so good but it was very good yeah I can go back and you will I'm I'm getting to the
sled you mean the image with the the 1 of the 4 here uh so
what was it was black and the other is it's because they tried to to and get the the characters of the background and the ones that they
isolated from the background of perfectly sharp in in a separate layer which then is compressed by genetic to so the 1 that they didn't get uh they didn't get it out of the ground the 1 Mr. background that splurge this is the technique and that's slide is actually on the correct another question None of them
thanks again what you can do
Freeware
Presentation of a group
Open source
Virtual machine
Coma Berenices
Bit
Theory
Message passing
Video game
Computer animation
Oval
Personal digital assistant
Computer science
Information security
Metropolitan area network
Context awareness
Standard deviation
Process (computing)
Computer animation
Term (mathematics)
Theory
Right angle
Public key certificate
Summation
Computer animation
Information
Fitness function
Theory
Right angle
Public key certificate
Form (programming)
Theory
Rule of inference
Medical imaging
Pixel
Computer animation
Graph coloring
Gradient
Noise
Line (geometry)
Subtraction
Connected space
Pixel
State of matter
State of matter
Mereology
Public key certificate
Medical imaging
Computer animation
CNN
Energy level
Cuboid
Row (database)
Moving average
Right angle
Physical system
Slide rule
Email
Computer animation
Computer network
Letterpress printing
Electronic mailing list
Virtual machine
Belegleser
Local Group
Metre
Area
Service (economics)
Computer animation
Multiplication sign
Square number
Virtual machine
Cuboid
Probability density function
Number
Area
Metre
Pixel
Service (economics)
Random number generation
Multiplication sign
Virtual machine
Bit
Computer font
Number
Power (physics)
Medical imaging
Tablet computer
Computer animation
Personal digital assistant
Square number
Energy level
Right angle
Information security
Subtraction
Family
Medical imaging
Email
Goodness of fit
Computer animation
Multiplication sign
Order (biology)
Computer science
Website
Subtraction
Number
Form (programming)
Point (geometry)
Multiplication sign
Moment (mathematics)
Interactive television
Fitness function
Entire function
Twitter
Proof theory
Number
Computer animation
Personal digital assistant
Software testing
Subtraction
Context awareness
Existence
Computer animation
Single-precision floating-point format
Multiplication sign
Energy level
Website
Quicksort
Units of measurement
Sign (mathematics)
Computer animation
Online help
Set (mathematics)
Arithmetic progression
Web page
Reading (process)
Slide rule
Greatest element
Multiplication sign
Source code
Number
Sign (mathematics)
Blog
Acoustic coupler
Software testing
Rule of inference
Product (category theory)
Block (periodic table)
Structural load
Moment (mathematics)
Term (mathematics)
Plot (narrative)
Number
Computer animation
Uniform resource name
Blog
Hydraulic jump
Family
Design of experiments
Mapping
Multiplication sign
Home page
Bit
Airfoil
Emulation
Formal language
Forest
Logic
Moving average
Physical law
E-learning
Maize
Traffic reporting
Library (computing)
Surjective function
Newton's law of universal gravitation
Metropolitan area network
Time zone
Raw image format
Email
Multiplication
Process capability index
System call
Inflection point
Number
Software development kit
Word
Computer animation
4 (number)
Uniform resource name
System on a chip
Boom (sailing)
Computer cluster
Kerr-Lösung
Right angle
Units of measurement
Computer animation
Order (biology)
Electronic mailing list
Statement (computer science)
5 (number)
Mereology
Shift operator
Video game
Computer animation
Term (mathematics)
Mathematics
Universe (mathematics)
Belegleser
Reading (process)
Scale (map)
Scaling (geometry)
Presentation of a group
Multiplication sign
Online help
Disk read-and-write head
Medical imaging
Computer animation
Term (mathematics)
Data compression
Blog
Text editor
Belegleser
Text editor
Right angle
Descriptive statistics
Wide area network
Scale (map)
Game controller
Multiplication sign
Symbol table
Medical imaging
Machine learning
Computer animation
Data compression
Internet service provider
Energy level
Normal (geometry)
Pattern language
Belegleser
Text editor
Right angle
Data management
Units of measurement
Pixel
Satellite
Computer file
Information
Multiplication sign
Computer-generated imagery
Heat transfer
Mereology
Data transmission
Medical imaging
Digital photography
Computer animation
Data compression
Term (mathematics)
Data storage device
Data compression
Energy level
Software testing
Software testing
Subtraction
Pixel
Information
File format
Block (periodic table)
Insertion loss
Distance
Trigonometric functions
Medical imaging
Digital photography
Wave
Computer animation
Graph coloring
Data compression
Program slicing
Data compression
Reduction of order
Subtraction
Flow separation
Symbol table
Medical imaging
Strategy game
Data compression
Data storage device
Gender
Data compression
Symbol table
Medical imaging
Pixel
Spacetime
Data storage device
Data compression
Code
Pattern language
Symbol table
Resultant
Local Group
Addition
Satellite
Virtual machine
Proof theory
Medical imaging
Goodness of fit
Pattern matching
Computer animation
Personal digital assistant
Velocity
Data compression
Operator (mathematics)
Data compression
Pattern language
Right angle
Office suite
Belegleser
Error message
Asynchronous Transfer Mode
Relief
Point (geometry)
Email
Satellite
Virtual machine
Disk read-and-write head
Video game
Sign (mathematics)
Video game
Computer animation
Touch typing
Data compression
5 (number)
God
Point (geometry)
Multiplication sign
Computer file
Computer-generated imagery
Moment (mathematics)
Physicalism
6 (number)
Bit
Substitute good
Subset
Error message
Computer animation
Data compression
Computer network
Statement (computer science)
Quicksort
Probability density function
Word
Computer animation
Data compression
Block (periodic table)
Mathematics
6 (number)
Set (mathematics)
Boss Corporation
Pattern matching
Computer animation
Multiplication sign
Principal ideal
6 (number)
Software testing
Bit
Set (mathematics)
Distance
System call
Substitute good
Web page
Default (computer science)
Scaling (geometry)
Inheritance (object-oriented programming)
Euler angles
Principal ideal
Virtual machine
1 (number)
6 (number)
3 (number)
Set (mathematics)
Computer animation
Factory (trading post)
Subtraction
Error message
Computer-generated imagery
Lemma (mathematics)
6 (number)
Set (mathematics)
Interface (computing)
Field (computer science)
Goodness of fit
Computer animation
Hooking
Data compression
Statement (computer science)
Factory (trading post)
Website
Energy level
Message passing
Wide area network
Data integrity
World Wide Web Consortium
Default (computer science)
Web page
Asynchronous Transfer Mode
Statistics
Computer animation
Venn diagram
Uniform resource name
Data compression
Spectrum (functional analysis)
Number
Revision control
Internetworking
Computer animation
Blog
Workstation
Right angle
Digitizing
Hand fan
Plot (narrative)
Information management
Internetworking
Computer animation
DDR SDRAM
File archiver
Virtual machine
Weight
Process (computing)
Quicksort
Entire function
Theory
Window
Raw image format
Manufacturing execution system
Link (knot theory)
Model theory
Moment (mathematics)
Coma Berenices
Client (computing)
Bit
Caustic (optics)
Line (geometry)
Euler angles
Maxima and minima
Dressing (medical)
Number
Internetworking
Computer animation
Data compression
Electronic meeting system
Dew point
Moving average
Software testing
Office suite
Units of measurement
Modem
Context awareness
Patch (Unix)
Computer-generated imagery
Distance
Mereology
Medical imaging
Number
Computer animation
Uniform resource name
Software
Pattern language
Belegleser
Pressure
Units of measurement
Computer-assisted translation
Newton's law of universal gravitation
Reading (process)
Freeware
Patch (Unix)
Multiplication sign
Online help
Pattern matching
Arithmetic mean
Data compression
Software
Factory (trading post)
Office suite
Error message
Default (computer science)
Inheritance (object-oriented programming)
Patch (Unix)
Disk read-and-write head
Maxima and minima
Data mining
Number
Message passing
Computer animation
Telecommunication
Field (mathematics)
Data compression
Statement (computer science)
Normal (geometry)
Asynchronous Transfer Mode
Pixel
Rectangle
Food energy
Local Group
Number
Summation
Number
Digital photography
Sign (mathematics)
Computer animation
Factory (trading post)
Energy level
Pattern language
Digitizing
Subtraction
Resultant
Asynchronous Transfer Mode
Area
Point (geometry)
Metropolitan area network
Default (computer science)
Raw image format
Nim-Spiel
Process (computing)
Multiplication sign
Insertion loss
Medical imaging
Number
Computer animation
Linker (computing)
Factory (trading post)
Computer cluster
Videoconferencing
Factory (trading post)
Gamma function
Pressure
Digitizing
Vacuum
YouTube
Default (computer science)
Standard deviation
Code
Multiplication sign
System call
Inference
Number
Loop (music)
Kernel (computing)
Personal digital assistant
Data compression
Blog
Factory (trading post)
Videoconferencing
Factory (trading post)
Right angle
Office suite
Digitizing
State of matter
Moment (mathematics)
Length
Flip-flop (electronics)
Term (mathematics)
Ordinary differential equation
System call
Storage area network
Summation
Number
Computer animation
Lie group
Uniform resource name
Telecommunication
Statement (computer science)
Factory (trading post)
Software testing
Right angle
Statement (computer science)
Game theory
Wide area network
Mobile Web
Asynchronous Transfer Mode
Software bug
12 (number)
Patch (Unix)
Multiplication sign
Patch (Unix)
Control flow
Number
Computer animation
Data compression
Software
Factory (trading post)
Motion blur
Software testing
Belegleser
Bus (computing)
Escape character
Asynchronous Transfer Mode
Process (computing)
Computer animation
Patch (Unix)
Multiplication sign
Software testing
Gamma function
Family
Logic gate
Computer
Traffic reporting
Enterprise architecture
Group action
Product (category theory)
Multiplication sign
Patch (Unix)
Electronic mailing list
Translation (relic)
10 (number)
Number
Message passing
Word
Computer animation
Personal digital assistant
Computer science
Dependent and independent variables
Error message
Proof theory
Trail
Enterprise architecture
Computer animation
Information
Single-precision floating-point format
Quicksort
Data management
Units of measurement
Slide rule
Software bug
Presentation of a group
View (database)
Moment (mathematics)
Covering space
Formal language
Twitter
Data management
Proof theory
Computer animation
Software testing
Position operator
Hydraulic jump
Software engineering
Decision theory
1 (number)
Similarity (geometry)
Public key certificate
Word
Medical imaging
Pattern matching
Causality
Linker (computing)
Authorization
Inverter (logic gate)
Office suite
Belegleser
Information security
Physical system
Metropolitan area network
Enterprise architecture
Information
Coma Berenices
Maxima and minima
Word
Process (computing)
Computer animation
Website
Units of measurement
Web page
Multiplication sign
Coma Berenices
Instance (computer science)
Line (geometry)
Public key certificate
Number
Connected space
Word
Frequency
Sign (mathematics)
Computer animation
Internetworking
Uniform resource name
Order (biology)
Right angle
Pattern language
Probability density function
Web page
Word
Slide rule
Slide rule
Computer animation
Linker (computing)
Cuboid
Coma Berenices
Theory
Freezing
Metropolitan area network
Medical imaging
Slide rule
Computer animation
Meeting/Interview
Uniform resource name
1 (number)
Equation of state
Computer animation
Meeting/Interview

Metadata

Formal Metadata

Title Lies, damned lies and scans
Title of Series FrOSCon 2015
Part Number 34
Number of Parts 80
Author Kriesel, David
License CC Attribution 3.0 Unported:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
DOI 10.5446/19565
Publisher Free and Open Source software Conference (FrOSCon) e.V.
Release Date 2015
Language English

Content Metadata

Subject Area Information technology
Abstract On the scale of things too horrible to contemplate, "document-altering scanner" is right up there with "flesh-eating bacteria". Since 2006, Xerox scancopiers literally are making stuff up. They, for example, replace digits with others in scans. The replacement digits are layouted perfectly into the page, so the errors are hard to see. Sounds unbelievably insidious, but it's true. Drug prescriptions, construction plans, just anything can be affected. David Kriesel

Recommendations

Loading...
Feedback
AV-Portal 3.5.0 (cb7a58240982536f976b3fae0db2d7d34ae7e46b)

Timings

  776 ms - page object