captcha

29
Prabhakar Verma “08MC30” CAPTCHA

Upload: jacoba

Post on 22-Feb-2016

52 views

Category:

Documents


0 download

DESCRIPTION

CAPTCHA. Prabhakar Verma “08MC30”. Introduction. A CAPTCHA is a program that protects websites against bots by generating and grading tests that humans can pass but current computer programs cannot. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CAPTCHA

Prabhakar Verma“08MC30”

CAPTCHA

Page 2: CAPTCHA

Introduction

A CAPTCHA is a program that protects websites against bots by generating and grading tests that humans can pass but current computer programs cannot.

Page 3: CAPTCHA

A CAPTCHA  is a type of challenge-response test used in computing to ensure that the response is not generated by a computer.

CAPTCHA requires that the user type letters or digits from a distorted image that appears on the screen.

Page 4: CAPTCHA

Characteristics

A CAPTCHA is a means of automatically generating new challenges which:•Current software is unable to solve accurately.•Most humans can solve•Does not rely on the type of CAPTCHA being new to the attacker.

CAPTCHAs rely on difficult problems in artificial intelligence.

Page 5: CAPTCHA

Origin

First developed by Alta Vista in 1997.

The term coined in 2000 by Luis von Ahn , Manuel Blum and Nicholas J. Hopper of Carnegie Mellon University and John Langford of IBM.

Primitive CAPTCHAs seem to have been developed in 1997 by Andrei Broder, Martin Abadi, Krishna Bharat, and Mark Lillibridge to prevent bots from adding URLs to their search engine.

Page 6: CAPTCHA

Turing Test

Proposed by Alan Turing.

To test a machine’s level of intelligence Human judge asks questions to two participants, one is a machine, he doesn’t know which is which, If judge can’t tell which is the machine, the machine passes the test.

CAPTCHA employs a reverse Turing test,judge = CAPTCHA program,

participant = user if user passes CAPTCHA, he is human

if user fails, it is a machine

Page 7: CAPTCHA

1. Text Based CAPTCHAs

2. Graphics Based CAPTCHAs

3. Audio or Sound Based CAPTCHAs

Types of CAPTCHAs

Page 8: CAPTCHA

Typically relay on sophisticated distortion of text images rendering them unrecognizable to the state of the art of the pattern recognition programs but recognizable by humans.

Examples:

Simple, normal language questions: What is sum of three and thirty-five? If today is Saturday, what is day after

tomorrow? Very effective, needs a large question bank Cognitively challenged users find it hard .

Text Based CAPTCHAs

Page 9: CAPTCHA

Gimpy:

Originally designed by Yahoo and CMU.Based on human ability to read heavily distorted

and corrupted text.works by choosing a certain number of words from

a dictionary, and then displaying them corrupted and distorted in an image; after that Gimpy asks the user to type the words displayed in that image.

Page 10: CAPTCHA

EZ-Gimpy:A modified version of Gimpy.Used in Yahoo Messenger Service.It contains only one random character string.The word is random and not picked from the dictionary.Its not a good implementation of CAPTCHA, and already broken

OCRs.

Page 11: CAPTCHA

MSN Passport service CAPTCHAs:ts provided for Microsoft MSN services.uses 8 characters. Warping is used to distort.Its very strongly implemented and hasn’t been

broken.

Page 12: CAPTCHA

Requires user to perform image recognition test.

Graphics Based CAPTCHAs

Page 13: CAPTCHA

IMAGINATION:CAPTCHA that requires two steps to be passed. first step visitor clicks elsewhere on the picture that

composed of a few images and selects in this way a single image.

second step the selected image is loaded. It is enlarged but very distorted. Also variants of the answer are loaded on the client side. The visitor should select a correct answer from the set of the proposed words.

Page 14: CAPTCHA

BONGO:

After M.M.Bongard, pattern recognition expert.User has to solve a pattern recognition problem.

Page 15: CAPTCHA

ASSIRA:

Animal Species Image Recognition for Restricting Access.

It’s a HIP that works by asking users to identify photographs of cats and dogs.

Difficult for computers but humans can accomplish it very quickly and accurately.

Page 16: CAPTCHA

Require user to solve a speech recognition test. In this version of captcha letters are read aloud instead of being displayed in an image.Helps visually disabled usersBelow is the Google’s audio enabled CAPTCHA.

Audio CAPTCHAs

Page 17: CAPTCHA

3DCaptcha is the "captcha nice to humans, bad to machines".It is written in PHP. A new approach to captchas, using human's spatial cognition abilities to differentiate humans from machines.It uses a markov-chain to generate words that resemble human language and are easy to type, yet avoid dictionary lookups.It filters profane language.It's easy to deploy.

3D CATCHA

Page 18: CAPTCHA

Free CAPTCHA service that helps to digitize books, newspapers and old time radio shows.reCAPTCHA improves the process of digitizing books by sending words that cannot be read by computers to the Web in the form of CAPTCHAs for humans to decipher. Each word that cannot be read correctly by OCR is placed on an image and used as a CAPTCHA.This is possible because most OCR programs alert you when a word cannot be read correctly.

Re-CAPTCHA

Page 19: CAPTCHA

Working of reCAPTCHA:Two words are shown, one word is known as Control Word, and another one is known a questionable word.System assumes that if human types the control word correctly, the questionable word is also correct.The identification performed by each OCR program is given a value of 0.5 points, and each interpretation by a human is given a full point.Once a given identification hits 2.5 votes, the word is considered called.

Page 20: CAPTCHA

Applications

1. Preventing Comment Spam in Blogs

2. Protecting Website Registration

3. Protecting Email Addresses From Scrapers

4. Online Polls5. Preventing Dictionary Attacks6. Search Engine Bots7. Worms and Spam

Page 21: CAPTCHA

Called Hard-AI problems. CAPTCHA tests are based on open problems in artificial intelligence (AI). A win-win scenario:

either a CAPTCHA is not broken and there is a way to differentiate humans from computers.

Or the CAPTCHA is broken and an AI problem is solved.Thus AI knowledge is advanced if CAPTCHAs are broken.

Advancing Artificial Intelligence

Page 22: CAPTCHA

Things to keep in mind: Don’t store CAPTCHA solution in Web page’s

metadata A CAPTCHA is no good if it doesn't distortNeed a large database of different CAPTCHA

questions Avoid repetition of questions

Constructing CAPTCHAs

Page 23: CAPTCHA

CAPTCHA Logic:

Generate the question Persist the correct answer Present the question to user Evaluate answer, if incorrect, start again-- Generate a

different CAPTCHA If correct, allow access to user

Page 24: CAPTCHA

GUIDELINES:

Accessibility Image security Script security Security after widespread adoptionCustom implementation or a general CAPTCHA?

Page 25: CAPTCHA

Cracking CAPTCHAs through programs

Convert CAPTCHA into greyscale Detect patterns in the image corresponding to

characters Or, read session files of that user and know the

CAPTCHA word Solution: Only store a hash of the CAPTCHA word

in session files

Breaking CAPTCHAs

Page 26: CAPTCHA

Usability issues: W3C mandates Web to be accessible to all people Some CAPTCHAs are inaccessible to visually

impaired, cognitively challenged people

Compatibility issues: JavaScript may need to be activated in browsers Some may need Adobe Flash plugin installed

Issues with CAPTCHAs

Page 27: CAPTCHA

Real World

Page 28: CAPTCHA

CAPTCHAs are an effective way to counter bots and reduce spam

They serve dual purpose– help advance AI knowledge Applications are varied– from stopping bots to

character recognition & pattern matching Some issues with current implementations represent

challenges for future improvements

Conclusion

Page 29: CAPTCHA

PRABHAKAR VERMA