Straightforward data science intro

This looks to be an interesting response to the call to be more data literate. Via Flowing Data, a straightforward and potentially free way to get skilled up with R, without needing to install any software, it seems.

Chromebook Data Science – a free online data science program for anyone with a web browser
The reason they are called Chromebook Data Science is because philosophically our goal was that anyone with a Chromebook could do the courses. All you need is a web browser and an internet connection. The courses all take advantage of RStudio Cloud so that all course work can be completed entirely in a web browser. No need to install software or have the latest MacBook Computer.

Here’s some info on what the courses cover, including introductions to R and GitHub. Worth a look?

Excel’s getting interesting. No, really

News that Excel will soon be expanding its range of data types, enabling a much richer and more dynamic experience.

Excel Data Types
AI powered Excel Data Types will transform the way we work with Excel by enabling a cell to contain much more than text, numbers or formulas.

There are currently two Excel data types available to Office 365 users; Stocks and Geography. Let’s start with the Geography Data Type that can take a table of countries and return rich data that can be referenced in Excel formulas and expand into further columns.

excel-getting-interesting-2

Mynda takes us through many other examples of how these new data types can be used and referenced in our spreadsheets. And it seems like this is just the beginning.

The Excel team have big plans for Data Types with more coming, including the ability to create your own data types unique to your organisation. Imagine data types for Employees, Products, Stores, Regions… the list is endless.

“Reduced”?

Have I just been insulted by the head of Ofsted?

Spielman: Teachers ‘reduced to data managers’
Teachers have been reduced to “data managers” instead of “experts in their field”, Ofsted chief inspector Amanda Spielman will argue today. “I don’t know a single teacher who went into teaching to get the perfect Progress 8 score (a measure of pupil progress),” she will tell a schools conference in Newcastle this morning.

I see her point, though that’s unfortunate paraphrasing from TES. The Guardian’s version, after the speech actually took place, also has the line ‘reduced teachers to the status of “data managers”.’

Here’s the full, less patronising, quote, with no reduction to be seen:

Amanda Spielman speech to the SCHOOLS NorthEast summit
The bottom line is that we must make sure that we, as an inspectorate, complement rather than intensify performance data, because our curriculum research and a vast amount of sector feedback have told us that a focus on performance data is coming at the expense of what is taught in schools.

A new focus on substance should change that, bringing the inspection conversation back to the substance of young people’s learning and treating teachers like the experts in their field, not just data managers. I don’t know a single teacher who went into teaching to get the perfect progress 8 score. They go into it because they love what they teach and want children to love it too. That is where the inspection conversation should start and with the new framework, we have an opportunity to do just that.

Follow the data

I’m hearing more and more about data ethics. It wasn’t ‘a thing’ before, was it? But it certainly is now. Here’s a very interesting take on it: flow.

The ethics of data flow
In Privacy in Context, Helen Nissenbaum connects data’s mobility to privacy and ethics. For Nissenbaum, the important issue isn’t what data should be private or public, but how data and information flow: what happens to your data, and how it is used. Information flows are central to our expectations of privacy, and respecting those expectations is at the heart of data ethics.

It’s not what they’ve got, but what they do with it that matters.

The infamous Target case, in which Target outed a pregnant teenager by sending ad circulars to her home, is a great example. We all buy things, and when we buy things, we know that data is used—to send bills and to manage inventory, if nothing else. In this case, the surprise was that Target used this customer’s purchase history to identify her as pregnant, and send circulars advertising products for pregnant women and new mothers to her house. The problem isn’t the collection of data, or even its use; the problem is that the advertising comes from, and produces, a different and unexpected data flow. The data that’s flowing isn’t just the feed to the marketing contractor. That ad circular, pushed into a mailbox (and read by the girl’s father) is another data flow, and one that’s not expected.

[…]

Everyone who works with data knows that data becomes much more powerful when it is combined with data from other sources. Data that seems innocuous, like a grocery store purchase history, can be combined with geographic data, medical data, and other kinds of data to characterize users and their behavior with great precision. Knowing whether a person purchases cigarettes can be of great interest to an insurance company, as can knowing whether a cardiac patient is buying bacon.

The article is written by and for data developers, primarily, and poses more questions than it can answer, especially around the thorny concept of data deletion. It’s an interesting read, but it left me wondering if those GDPR data protection principles will ever be fully put into practice.

Happy statistics day

It’s GCSE results day and, despite the new grading system, the news people are bringing out updated versions of their usual it’s-getting-better-it’s-getting-worse stories.

GCSE results day 2018: New ‘tougher’ exams favour boys as gender gap narrowest in seven years
Girls remain in the lead, with 23.4 per cent achieving one of the highest grades, which is the same as last year, compared to 17.1 per cent of boys, up from 16.2 per cent last year. But the gap in top grades between boy and girls is now at its narrowest since 2010, with boys just 6.3 per cent behind girls, down from 7.2 per cent last year.

GCSE results rise despite tougher exams
A total of 20 of the most popular GCSE subjects in England have been graded for the first time in the numerical format – plus English and maths, which were introduced in the new format last year. These include history, geography, sciences and modern languages, all of which have been designed to be more difficult.

Of those achieving all grade 9s – and taking at least seven of the new GCSEs – almost two-thirds were girls.

GCSEs: boys close gap on girls after exams overhaul
Boys appear to have been the major beneficiary of the overhaul of GCSE examinations taken in England for the first time this summer, as results showed across-the-board improvements in boys gaining top marks while girls saw their share of top grades dip.

Across the UK the proportion of students gaining an A or 7 and above, the new top grade used in England, rose above 20%, with boys in England closing the gap on girls with an almost one percentage point rise to 17.1% with girls unchanged at 23.4%.

In the reformed GCSEs in England, 4.3% of the results were the new highest 9 grade, set at a higher mark than the previous A* grade. The figures on Thursday showed 732 students attained seven or more grade 9s.

Despite the improvements by boys in England they were still outperformed by girls at the highest level: 5% of entries by girls received 9s, compared with just 3.6% of boys.

GCSE pass rate goes UP – but fewer students get new top ‘9’ grade compared to old A* mark
The overall pass rate – the percentage of students getting a 4 or above or a C or above – was 66.9 per cent, compared with 66.4 per cent last year.

But just 4.3 per cent of exams were given the new 9 grade, which was brought in to reward the absolute highest achievers. Just 732 students in England got a clean sweep of seven or more grade 9s.

Previously around seven per cent of exams scored the top A* grade.

The more detail-oriented education sector websites are worth a read, if you really want to dig down into all this.

GCSE results 2018: How many grade 9s were awarded in the newly reformed subjects?
There has been a curious amount of interest in how many students might achieve straight 9s in all subjects. It seems to have started with a throwaway remark on twitter by the then-chief scientific adviser at the Department for Education that only two students would do so. Tom Benton from Cambridge Assessment then produced some excellent research showing that it would, in fact, be several hundred.

Today Ofqual has answered the question once and for all. A total of 732 students who took at least seven reformed GCSEs achieved grade 9 in all of them. Given that fewer grade 9s are awarded than grade A*, it should come as no surprise that fewer students will achieve straight grade 9s compared to straight grade A*s.

But it can get a little heavy-going at times.

GCSE results day 2018: The main trends in entries and grades
Across all subjects, 21.5% of entries were awarded a grade 7/A or above, compared to 21.1% last year. At grade 4/C or above, 69.3% of entries achieved the standard this year, compared to 68.9% last year. Both figures have been on something of a downward trend since 2015, so this year’s figures arrest this decline.

GCSE and A-Level results analysis
Explore trends in national entry and attainment data between 2014 and 2018 in:
All subjects
Additional mathematics
Additional science
Art and design subjects
Biology
Business and communication systems
Business studies
Chemistry
Citizenship studies

GCSE 2018 variability charts: Are your results normal?
Each year Ofqual produces boring-sounding variability charts. It sounds dull but they show how many centres, i.e. schools or colleges, dropped or increased their results compared with the previous year. This means that if you dropped, say, 25 per cent in one subject, you can see how many other schools also saw the same dip.

Let’s give the last word to the JCQ and Ofqual, and have done with it: I’m getting a headache.

JCQ Joint Council for Qualifications: Examination results
Each year, JCQCIC collates the collective results for its members from more than 26 million scripts and items of coursework. We only publish collated results from our members though and cannot supply regional, centre or candidate information.

Ofqual Analytics
Ofqual analytics presents a selection of data in an engaging and accessible way by using interactive visualisations. We hope this innovative approach to presenting data will make it easier to understand and explore the data we produce.

Map of GCSE (9 to 1) grade outcomes by county in England
The map shows reformed GCSE full course results (the percentage of students achieving specific grades) in England by subject and county for the summer 2018 examination series as well as the summer 2017 examination series. Data in the map represents the results that were issued on results day for both years (23 August 2018 and 24 August 2017) and do not reflect any changes following post-results services.

happy-statistics-day-2

(All I know is that my boy got his GCSE results today too, and they’re a credit to the amount of time and effort he’s put in over the years.)

AI to the rescue

In 2016 the RNIB announced a project between the NHS and DeepMind, Google’s artificial intelligence company.

Artificial intelligence to look for early signs of eye conditions humans might miss
With the number of people affected by sight loss in the UK predicted to double by 2050, Moorfields Eye Hospital NHS Foundation Trust and DeepMind Health have joined forces to explore how new technologies can help medical research into eye diseases.

This wasn’t the only collaboration with the NHS that Google was involved in. There was another project, to help staff monitor patients with kidney disease, that had people concerned about the amount of the medical information being handed over.

Revealed: Google AI has access to huge haul of NHS patient data
Google says that since there is no separate dataset for people with kidney conditions, it needs access to all of the data in order to run Streams effectively. In a statement, the Royal Free NHS Trust says that it “provides DeepMind with NHS patient data in accordance with strict information governance rules and for the purpose of direct clinical care only.”

Still, some are likely to be concerned by the amount of information being made available to Google. It includes logs of day-to-day hospital activity, such as records of the location and status of patients – as well as who visits them and when. The hospitals will also share the results of certain pathology and radiology tests.

The Google-owned company tried to reassure us that everything was being done appropriately, that all those medical records would be safe with them.

DeepMind hits back at criticism of its NHS data-sharing deal
DeepMind co-founder Mustafa Suleyman has said negative headlines surrounding his company’s data-sharing deal with the NHS are being “driven by a group with a particular view to peddle”. […]

All the data shared with DeepMind will be encrypted and parent company Google will not have access to it. Suleyman said the company was holding itself to “an unprecedented level of oversight”.

That didn’t seem to cut it though.

DeepMind’s data deal with the NHS broke privacy law
“The Royal Free did not have a valid basis for satisfying the common law duty of confidence and therefore the processing of that data breached that duty,” the ICO said in its letter to the Royal Free NHS Trust. “In this light, the processing was not lawful under the Act.” […]

“The Commission is not persuaded that it was necessary and proportionate to process 1.6 million partial patient records in order to test the clinical safety of the application. The processing of these records was, in the Commissioner’s view, excessive,” the ICO said.

And now here we are, some years later, and that eye project is a big hit.

Artificial intelligence equal to experts in detecting eye diseases
The breakthrough research, published online by Nature Medicine, describes how machine-learning technology has been successfully trained on thousands of historic de-personalised eye scans to identify features of eye disease and recommend how patients should be referred for care.

Researchers hope the technology could one day transform the way professionals carry out eye tests, allowing them to spot conditions earlier and prioritise patients with the most serious eye diseases before irreversible damage sets in.

That’s from UCL, one of the project’s partners. I like the use of the phrase ‘historic de-personalised eye scans’. And it doesn’t mention Google once.

Other reports also now seem to be pushing the ‘AI will rescue us’ angle, rather than the previous ‘Google will misuse our data’ line.

DeepMind AI matches health experts at spotting eye diseases
DeepMind’s ultimate aim is to develop and implement a system that can assist the UK’s National Health Service with its ever-growing workload. Accurate AI judgements would lead to faster diagnoses and, in theory, treatment that could save patients’ vision.

Artificial intelligence ‘did not miss a single urgent case’
He told the BBC: “I think this will make most eye specialists gasp because we have shown this algorithm is as good as the world’s leading experts in interpreting these scans.” […]

He said: “Every eye doctor has seen patients go blind due to delays in referral; AI should help us to flag those urgent cases and get them treated early.”

And it seems AI can help with the really tricky problems too.

This robot uses AI to find Waldo, thereby ruining Where’s Waldo
To me, this is like the equivalent of cheating on your math homework by looking for the answers at the back of your textbook. Or worse, like getting a hand-me-down copy of Where’s Waldo and when you open the book, you find that your older cousin has already circled the Waldos in red marker. It’s about the journey, not the destination — the process of methodically scanning pages with your eyes is entirely lost! But of course, no one is actually going to use this robot to take the fun out of Where’s Waldo, it’s just a demonstration of what AutoML can do.

There’s Waldo is a robot that finds Waldo

We all need to be data literate

This article from Harvard Business Review doesn’t mention schools once, but I think it fits perfectly well in that setting.

The democratization of data science
Intelligent people find new uses for data science every day. Still, despite the explosion of interest in the data collected by just about every sector of American business — from financial companies and health care firms to management consultancies and the government — many organizations continue to relegate data-science knowledge to a small number of employees.

That’s a mistake — and in the long run, it’s unsustainable.

It goes on to outline the three steps necessary to create a more data literate organisation; share data tools, spread data skills, and spread data responsibility. Couldn’t agree more. It’s well worth a read.