How to calculate your ATAR from trial/past paper marks (4 Viewers)

Joined
Sep 9, 2022
Messages
52
Gender
Male
HSC
2023
There's a really common misconception that you can put trial marks straight into ATAR calculators - and if you've done this, you'll probably notice a really disappointing ATAR that's much lower than what you expected. The reason for this is that ATAR calculators are actually designed for 'aligned' marks, which are the marks you get on the actual HSC results day. These aligned marks are entirely different to your raw mark in the exam, which aren't released unless you pay NESA $60 per subject. I won't go into too much detail, but basically alignment usually increases your marks (sometimes by as much as 30 marks!) to match NESA's band descriptors, but the long and short of it is that you can't use normal ATAR calculators with your trial/past paper marks.

To fill this hole I've been developing my own ATAR calculator, which allows you to input your raw marks (could be trial or past paper marks), converts them into aligned marks using extrapolated formulas based on data from https://rawmarks.info, and then follows a similar process to other ATAR calculators to convert your aligned marks into an ATAR.

https://docs.google.com/spreadsheets/d/1btYJ_CrnBl9mKI4jKoRW3ALxHKiKamtU5UtGawmbeOg/edit?usp=sharing
(you'll have to make a copy to use it)

DISCLAIMER: all data is based on the 2022 HSC unless indicated otherwise with italics or an *. If your trial was easier/harder than last years HSC then I obviously can't account for it, but you could just enter in a slightly lower/higher mark if you wish to - The ideal use for this calculator would be actually doing all of the 2022 HSC as past papers for near-perfect accuracy, but obviously that's a bit unrealistic. The other factor is that this is just an estimate of where you're at right now, and obviously things will change (usually for the better) over time, and your moderated School Assessment Mark will also be a factor in your marks.

DISCLAIMER 2: Just to clarify since people have been asking, this calculator will produce different results to UAC ATAR Compass, that is intended behaviour. The calculators are fundamentally different, ATAR Compass is intended for aligned marks while this calculator is for raw marks.

I hope this is helpful :)

Technical details (stop reading here if you're not interested)
Generally, the way the calculator itself functions is fairly simple - it's hidden normally but it's all visible if you open the "advanced" mode. The methodology for development was a little more interesting. This explanation will hopefully be helpful for anyone who wishes to understand how this calculator works or undertake a similar project in the future (and I just want to talk about it).

For the raw->aligned mark conversion, I found through research that the process of alignment is piecewise linear - NESA sets band cutoffs based on the band descriptors, and then linearly interpolates between marks (source here, also confirmed through my own testing). The data from https://rawmarks.info for many subjects includes multiple marks within each band range, and so I plotted all of the 2022 marks for each subject in desmos, and hence found the (linear) equation which resolved correct rounded aligned marks and matched to an exact rounded band cutoff for each available band (example), and then beyond available data I simply used a linear equation from 0 to the lowest known band cutoff. The accuracy of this method is dependent on the subject - for subjects with lots of raw marks data (e.g. most Mathematics subjects, English Advanced, Chemistry, Physics and Economics) this results in a perfect equation for the band 6 range and an equation which can theoretically be +/- 1 mark for the band 4/5 range. For most other subjects, the equation should theoretically be +/- 1 aligned mark - this is an inherent limit to inaccuracy once two data points are available from the middle/lower end of a band range as any further inaccuracy of band cutoff would result in an inaccurate result for the known data point. Below the available data range, the simple linear equation is theoretically not entirely valid, however given that the vast majority of students sit within or near the known data, its a sufficient compromise, which should provide results within a reasonable range that realistically is good enough given the low changes in scaled marks at low aligned marks, and it allows the calculator to properly function below known data where previous versions would simply output errors - though I still wouldn't advise usage of the calculator for a student with results in bands 1-3. Subjects which are marked with an asterisks required usage of multiple years of data - I resolved to only use this method when the data for one year was extremely scarce and newer data was fairly consistent, enabling a fairly limited degradation of accuracy, which should remain within the standard +/- 1 aligned mark in the vast majority of cases. Subjects which are marked with italics have data from long periods of time and/or a lack data within an included band range, and are only included due to requests - their data will be within +/- 2 aligned marks typically.

The equations found in desmos were then inputted into sheets through a relatively simple =IFS() formula to select the correct linear equation based on which band region a raw mark sits in, in addition to a simple MIN(x,100) function to remove the possibility of marks above 100 (or rather 50 for 1 unit subjects). The sheet uses two cells to store aligned marks, the first, row 3 is a rounded version of the aligned mark for the sake of creating a more simple user interface and preventing confusion given the rounded marks which a student is likely used to. However, row 7 contains the raw mark which is actually used for scaling calculations, as UAC has confirmed they use marks rounded to two decimal places.

The scaled mark conversion doesn't seem to follow any specific equation, rather a more complex system based on the results of each percentile of English marks within a subject, however, UAC provides significantly more data than NESA, and their data is much better separated by percentile cutoffs in the scaling report. From some basic testing using the quadratic line of best fit in Google Sheets, it seems possible to model the scaling process quite effectively using a quadratic equation (example). Hence, I used these simple equations of best fit, and found their accuracy to typically remain within my acceptable standard of +/- 1 mark (with the exception of one Maths Extension 1 data point, which seems to have been a data entry error on UAC's part given that it is a major outlier). However, below the 25th percentile the quadratic equation doesn't function well, so as with the alignment, I used a simple linear equation based on the gradient from 0 to the 25th percentile. For high scaling subjects this will function quite well as their scaling tends to be relatively linear nonetheless, however, for low scaling subjects, there is a sharp difference in gradient which is relatively unavoidable - its unknown how UAC handles marks below the 25th percentile, especially considering that there is such a high gradient for many low scaling subjects at the edge of known data, which must somehow convert to an extremely low gradient in order to prevent negative scaled marks. I may conduct further experimentation in the future with ATAR compass and data points from this forums to more accurately map out marks in this range in the future, but for now this is an acceptable solution, especially considering that by nature it is only necessary for the bottom 25th percentile, who are not typically users of a tool like this.

(1/2)
yo thanks for this a lot! just wondering if you could add visual arts in?
 

Bob99

temporarylol hater
Joined
Apr 1, 2022
Messages
228
Gender
Male
HSC
2022
There's a really common misconception that you can put trial marks straight into ATAR calculators - and if you've done this, you'll probably notice a really disappointing ATAR that's much lower than what you expected. The reason for this is that ATAR calculators are actually designed for 'aligned' marks, which are the marks you get on the actual HSC results day. These aligned marks are entirely different to your raw mark in the exam, which aren't released unless you pay NESA $60 per subject. I won't go into too much detail, but basically alignment usually increases your marks (sometimes by as much as 30 marks!) to match NESA's band descriptors, but the long and short of it is that you can't use normal ATAR calculators with your trial/past paper marks.

To fill this hole I've been developing my own ATAR calculator, which allows you to input your raw marks (could be trial or past paper marks), converts them into aligned marks using extrapolated formulas based on data from https://rawmarks.info, and then follows a similar process to other ATAR calculators to convert your aligned marks into an ATAR.

https://docs.google.com/spreadsheets/d/1btYJ_CrnBl9mKI4jKoRW3ALxHKiKamtU5UtGawmbeOg/edit?usp=sharing
(you'll have to make a copy to use it)

DISCLAIMER: all data is based on the 2022 HSC unless indicated otherwise with italics or an *. If your trial was easier/harder than last years HSC then I obviously can't account for it, but you could just enter in a slightly lower/higher mark if you wish to - The ideal use for this calculator would be actually doing all of the 2022 HSC as past papers for near-perfect accuracy, but obviously that's a bit unrealistic. The other factor is that this is just an estimate of where you're at right now, and obviously things will change (usually for the better) over time, and your moderated School Assessment Mark will also be a factor in your marks.

DISCLAIMER 2: Just to clarify since people have been asking, this calculator will produce different results to UAC ATAR Compass, that is intended behaviour. The calculators are fundamentally different, ATAR Compass is intended for aligned marks while this calculator is for raw marks.

I hope this is helpful :)

Technical details (stop reading here if you're not interested)
Generally, the way the calculator itself functions is fairly simple - it's hidden normally but it's all visible if you open the "advanced" mode. The methodology for development was a little more interesting. This explanation will hopefully be helpful for anyone who wishes to understand how this calculator works or undertake a similar project in the future (and I just want to talk about it).

For the raw->aligned mark conversion, I found through research that the process of alignment is piecewise linear - NESA sets band cutoffs based on the band descriptors, and then linearly interpolates between marks (source here, also confirmed through my own testing). The data from https://rawmarks.info for many subjects includes multiple marks within each band range, and so I plotted all of the 2022 marks for each subject in desmos, and hence found the (linear) equation which resolved correct rounded aligned marks and matched to an exact rounded band cutoff for each available band (example), and then beyond available data I simply used a linear equation from 0 to the lowest known band cutoff. The accuracy of this method is dependent on the subject - for subjects with lots of raw marks data (e.g. most Mathematics subjects, English Advanced, Chemistry, Physics and Economics) this results in a perfect equation for the band 6 range and an equation which can theoretically be +/- 1 mark for the band 4/5 range. For most other subjects, the equation should theoretically be +/- 1 aligned mark - this is an inherent limit to inaccuracy once two data points are available from the middle/lower end of a band range as any further inaccuracy of band cutoff would result in an inaccurate result for the known data point. Below the available data range, the simple linear equation is theoretically not entirely valid, however given that the vast majority of students sit within or near the known data, its a sufficient compromise, which should provide results within a reasonable range that realistically is good enough given the low changes in scaled marks at low aligned marks, and it allows the calculator to properly function below known data where previous versions would simply output errors - though I still wouldn't advise usage of the calculator for a student with results in bands 1-3. Subjects which are marked with an asterisks required usage of multiple years of data - I resolved to only use this method when the data for one year was extremely scarce and newer data was fairly consistent, enabling a fairly limited degradation of accuracy, which should remain within the standard +/- 1 aligned mark in the vast majority of cases. Subjects which are marked with italics have data from long periods of time and/or a lack data within an included band range, and are only included due to requests - their data will be within +/- 2 aligned marks typically.

The equations found in desmos were then inputted into sheets through a relatively simple =IFS() formula to select the correct linear equation based on which band region a raw mark sits in, in addition to a simple MIN(x,100) function to remove the possibility of marks above 100 (or rather 50 for 1 unit subjects). The sheet uses two cells to store aligned marks, the first, row 3 is a rounded version of the aligned mark for the sake of creating a more simple user interface and preventing confusion given the rounded marks which a student is likely used to. However, row 7 contains the raw mark which is actually used for scaling calculations, as UAC has confirmed they use marks rounded to two decimal places.

The scaled mark conversion doesn't seem to follow any specific equation, rather a more complex system based on the results of each percentile of English marks within a subject, however, UAC provides significantly more data than NESA, and their data is much better separated by percentile cutoffs in the scaling report. From some basic testing using the quadratic line of best fit in Google Sheets, it seems possible to model the scaling process quite effectively using a quadratic equation (example). Hence, I used these simple equations of best fit, and found their accuracy to typically remain within my acceptable standard of +/- 1 mark (with the exception of one Maths Extension 1 data point, which seems to have been a data entry error on UAC's part given that it is a major outlier). However, below the 25th percentile the quadratic equation doesn't function well, so as with the alignment, I used a simple linear equation based on the gradient from 0 to the 25th percentile. For high scaling subjects this will function quite well as their scaling tends to be relatively linear nonetheless, however, for low scaling subjects, there is a sharp difference in gradient which is relatively unavoidable - its unknown how UAC handles marks below the 25th percentile, especially considering that there is such a high gradient for many low scaling subjects at the edge of known data, which must somehow convert to an extremely low gradient in order to prevent negative scaled marks. I may conduct further experimentation in the future with ATAR compass and data points from this forums to more accurately map out marks in this range in the future, but for now this is an acceptable solution, especially considering that by nature it is only necessary for the bottom 25th percentile, who are not typically users of a tool like this.

(1/2)
damn i put in my hsc marks and i wish i got the atar it estimated 😢
 

Rattlehead15

Well-Known Member
Joined
Jun 30, 2022
Messages
369
Location
Alice Springs
Gender
Undisclosed
HSC
2023
you’re not the only one, the number of people who’ve told me “omg this is so inaccurate and overestimating” without actually reading the post or understanding how it works is kinda insane 😭
Dw carrots, you're just on a higher plane of intellectual being for the common board of studies user to understand
 

Charlie129

Member
Joined
Jun 30, 2022
Messages
34
Gender
Male
HSC
2023
No Design and Tech? Can't really blame you since raw marks database only has 2 years of data :( and both a similar range
 

Users Who Are Viewing This Thread (Users: 0, Guests: 4)

Top