jee joint seat allocation: synthetic data generation
TRANSCRIPT
JEE Joint Seat Allocation: Synthetic Data Generation
IIT Madras Team
Course Preference-Data Generation Engine
JEE(Main) (5 Lakh)
Candidates
Category Selector &
Gender Selector
Advanced Rank
Generator
Final Data
NIT Closing Rank
Lookup Table for IIT
and NIT
IIT Closing Ranks
Choice Generator
• Assign one of GEN/OBC/SC/ST category • Assign PwD if applicable. • Assign gender after category allocation. • For JEE(Adv) third gender provision is carried out ( if they have any
course restriction then accordingly the program could be tuned) In CML rank, it ensures
~ 66 % GEN ~ 30 % OBC ~ 3% SC ~ 0.8% ST Appropriately set PwD.
In addition, the candidates will be listed in
appropriate category rank lists, if so.
Category Selector
This percentage obtained is
based on JEE(Adv) 2013,2014 data.
• Start with JEE Advanced Rank = JEE Main Rank.
• Randomly choose a candidate and pick a neighbour at an offset • of atmost 20 away from its main rank.
• Swap their advanced ranks.
• Repeat this for 5,00,000 * 20.
Advanced Rank Generator
First 20,000 candidates allotted with Advanced ranks taking into consideration of all categories.
JEE(Main) and JEE(Adv) Ranking
Analysis of the data indicates that, there is a positive correlation between the two ranks.
Choice Generator
ANALYSIS OF JEE( ADV) 2013 & 2014
DATA
Category
Popular Courses (based on closing
ranks)
Rank
The candidates choice fillings string considered to be Ambitious, Rational, Safe and Random
Number of Programs with IIT/NIT: 217/612
Choice Generator Is highly
tunable/flexible.
Snap Shot of the Final Data
Snap Shot of the Final Data
Comparative Analysis of JEE(Adv) 2013/2014 data with Synthetic data
Percentage variation in number of OBC candidates in UR List
Number of candidates between any two percentages is the Same - in the last two years data
Percentage variation in number of SC candidates in UR List
Percentage of candidates at each level in UR in both years is nearly same
Percentage variation in number of ST candidates in UR List
The Pattern is near similar for both the years
Course Preferences at Different Rank Levels with only IIT
• The results indicates popularity of the courses for different Ranks- Irrespective of Category • The data arrived at based on first preference
Course preference Synthetic Data with NIT engine Vs JEE(Adv)- 2013
Highlights
• Ranks for 5 Lakh JEE(Main) candidates along with their category and gender is generated
• JEE( Adv) ranks for seat filling accommodating all categories were carried out.
• Gender classification is carried out. Provision carried out for third gender.
• Course choice filling for IITs (20,000 candidates) and NITs (5 Lakhs inclusive of JEE(adv) candidates) were carried out.
• The candidates choice fillings string for IITs and NITs considered to be Ambitious, Rational, Safe and Random
• Synthetic data has similarity with last few years data of JEE (adv) in terms of distribution of category candidates in CML, course choices, length of string etc.