• Current students
      • Student centre
        Enrol on a course/exam
        My enrolments
        Exam results
        Mock exams
        Learning Hub data privacy policy
      • Course information
        Students FAQs
        Student induction
        Course enrolment information
        Key dates
        Book distribution
        Timetables
        FAE Elective Information
      • Exams
        Exam Info: CAP1
        E-assessment information
        Exam info: CAP2
        Exam info: FAE
        Reasonable accommodation and extenuating circumstances
        Timetables for exams & interim assessments
        Interim assessments past papers & E-Assessment mock solutions
        Main examination past papers
        Information and appeals scheme
        JIEB: NI Insolvency Qualification
      • CA Diary resources
        Mentors: Getting started on the CA Diary
        CA Diary for Flexible Route FAQs
      • Admission to membership
        Joining as a reciprocal member
        Conferring dates
        Admissions FAQs
      • Support & services
        Recruitment to and transferring of training contracts
        CASSI
        Student supports and wellbeing
        Audit qualification
        Diversity and Inclusion Committee
    • Students

      View all the services available for students of the Institute

      Read More
  • Becoming a student
      • About Chartered Accountancy
        The Chartered difference
        What do Chartered Accountants do?
        5 Reasons to become a Chartered Accountant
        Student benefits
        School Bootcamp
        Third Level Hub
        Study in Northern Ireland
        Events
        Blogs
        Member testimonials 2022
        Become a Chartered Accountant podcast series
      • Entry routes
        College
        Working
        Accounting Technicians
        School leavers
        Member of another body
        International student
        Flexible Route
        Training Contract
      • Course description
        CAP1
        CAP2
        FAE
        Our education offering
      • Apply
        How to apply
        Exemptions guide
        Fees & payment options
        External students
      • Training vacancies
        Training vacancies search
        Training firms list
        Large training firms
        Milkround
        Training firms update details
        Recruitment to and transferring of training contract
        Interview preparation and advice
        The rewards on qualification
        Tailoring your CV for each application
        Securing a trainee Chartered Accountant role
      • Support & services
        Becoming a student FAQs
        Who to contact for employers
        Register for a school visit
    • Becoming a
      student

      Study with us

      Read More
  • Members
      • Members Hub
        My account
        Member subscriptions
        Annual returns
        Application forms
        CPD/events
        Member services A-Z
        District societies
        Professional Standards
        Young Professionals
        Careers development
        Diversity and Inclusion Committee
      • Members in practice
        Going into practice
        Managing your practice FAQs
        Practice compliance FAQs
        Toolkits and resources
        Audit FAQs
        Other client services
        Practice Consulting services
        What's new
      • Overseas members
        Working abroad
        Working in Australia
        Overseas members news
        Tax for returning Irish members
      • In business
        Networking and special interest groups
        Articles
      • Public sector
        Public sector news
        Public sector presentations
      • Support & services
        Letters of good standing form
        Member FAQs
        AML confidential disclosure form
        CHARIOT/Institute Technical content
        TaxSource Total
        The educational requirements for the audit qualification
        Pocket diaries
        Thrive Hub
    • Members

      View member services

      Read More
  • Employers
      • Training organisations
        Authorise to train
        Training in business
        Manage my students
        Incentive Scheme
        Recruitment to and transferring of training contracts
        Securing and retaining the best talent
        Tips on writing a job specification
      • Training
        In-house training
        Training tickets
      • Recruitment services
        Hire a qualified Chartered Accountant
        Hire a trainee student
      • Non executive directors recruitment service
      • Support & services
        Hire members: log a job vacancy
        Firm/employers FAQs
        Training ticket FAQs
        Authorisations
        Hire a room
        Who to contact for employers
    • Employers

      Services to support your business

      Read More
☰
  • The Institute
☰
  • Home
  • Articles
  • Students
  • Advertise
  • Subscribe
  • Archive
  • Podcasts
  • Contact us
Search
View Cart 0 Item
  • Home/
  • Accountancy Ireland/
  • Articles/
  • Careers/
  • Latest News

Careers

Reinforcement learning

Feb 05, 2018
What can AlphaGo-Zero’s reinforcement learning process teach teams about their actions?

Towards the end of 2017, we were introduced to AlphaGo-Zero, arguably the strongest player (albeit a virtual one) in the history of the ancient Chinese game of Go. Previous versions of AlphaGo learned to play by absorbing details of thousands of human amateur and professional games, but AlphaGo Zero learned to play simply by playing games against itself, starting from completely random play. It learned to play a game in which there are more theoretical moves available to players than atoms in the known universe by learning solely from itself. After 40 days of self-learning, AlphaGo-Zero defeated the previous human-champion-defeating version of AlphaGo by 100 games to 0. And on day 21, it reached the standard of the top human players of all time by achieving ‘grand master’ status.

In artificial intelligence (AI) terms, this is called ‘reinforcement learning’ – a learning process in which “the learning agent has to decide how to act to perform its task. In the (intentional) absence of existing training data, the agent learns from experience. It collects the training examples (this action was good, that action was bad) through trial and error as it attempts its task, with the goal of maximising long-term reward”, according to Vishal Maini, a machine-learning expert.

Creating its own effectiveness

One of the big lessons we have learned from Simon Sinek, author of Start With Why and Leaders Eat Last, is that, when we are making a presentation, delivering a training intervention or engaging in any activity where we need the buy-in of others, we need to start with why. Indeed, the idea of AlphaGo-Zero ‘creating its own effectiveness’ provides a compelling ‘why’ when discussing team coaching programmes and interventions.

Why do we coach teams? Because it puts learning from experience, with the goal of maximising long-term performance levels, right at the centre of the team’s agenda. It challenges the team to decide how to act in order to best achieve its commissioned purpose. It empowers the team to create its own effectiveness.

As Peter Senge points out in his excellent book, The Fifth Discipline, “we learn best from experience, but (sometimes) we never directly experience the consequences of our most important decisions” as often the primary consequences of our actions occur, as he puts it, “somewhere in the distant future or in a distant part of the larger system within which we operate”. In unpacking this thought, Senge introduces the concept of learning horizons, which he describes as the “breadth of vision in time and space within which we assess our effectiveness”. He claims that “when our actions have consequences beyond our learning horizon, it becomes impossible to learn from direct experience”.

The role of reflection

However, a growing body of evidence suggests that learning horizons can be significantly expanded through strong, reflective practice and it is in this area that team coaching programmes and interventions can be successfully utilised. Kouzes and Posner in The Leadership Challenge demonstrate that regular exploration of the question “what can we learn from this?” can triple team effectiveness when compared to those who rarely or never reflect in this way.

Thus, not only does “what can we learn from what just happened?” become a stock question in a team coach’s tool-kit, but it is frequently expanded into the wonderful question, “what can we learn from what just happened that will improve life for your customers’ customers?” This creates an impetus within the team towards thinking more systemically; towards locating itself within its wider organisational context, which in turn delivers much greater “breadth of vision in time and space”. In so doing, the coach is empowering the team to more deeply develop the capacity to take perspectives, view authority in new ways, and see shades of grey where they once saw only black and white.

Complexity, ambiguity and change

Today’s organisations want their workforce to handle complexity, ambiguity and all the accompanying stress involved in working in an environment of constant change. However, coping well with such issues is not simply a skill anyone can acquire, but more a way of living in the world.

Robert Kegan, author of The Evolving Self, calls this process one of “meaning making’ which, if you think about it, is pretty much the challenge that our artificially intelligent chum AlphaGo-Zero was faced with when presented, sans instructions, with a 2,500 year old Chinese game that offered more potential moves than the number of atoms in the known universe!

Volatility, uncertainty, complexity and ambiguity have become ubiquitous working companions. They are ever-present members of every team in every organisation, presenting us all with challenges that even five years ago would not have seemed feasible. It is within that working environment that our teams have to begin to, as the psychologists put it, “construe, understand, or make sense of events, relationships and the self.”

New terrain

This is new terrain for most and while many teams may not be presented with quite as many move options in each new situation as there are atoms in the known universe, there are a lot more options than there used to be. Teams therefore need to inculcate this ability to constantly self-teach into their DNA by “retaining, reaffirming, revising, or replacing elements of their orienting system to develop more nuanced, complex and useful systems”, as noted by James Gillies in The Meaning of Loss.

To do that, they need to learn to reflect, as Peter Hawkins puts it in Leadership Team Coaching, “on their own performance and multiple processes, and consolidate their learning ready for the next cycle of development: for themselves, for their wider systems and to create positive consequences somewhere in the distant future”.
And perhaps the most compelling “why” of team coaching is to empower them to do just that.

Ian Mitchell and Sian Lumsden are co-founders of Eighty20 Focus, a real-time executive coaching organisation.

The latest news to your inbox

Useful links

  • Current students
  • Becoming a student
  • Knowledge centre
  • Shop
  • District societies

Get in touch

Dublin HQ

Chartered Accountants
House, 47-49 Pearse St,
Dublin 2, Ireland

TEL: +353 1 637 7200
Belfast HQ

The Linenhall
32-38 Linenhall Street, Belfast
Antrim BT2 8BG, United Kingdom.

TEL: +44 28 9043 5840

Connect with us

CAW Footer Logo-min
GAA Footer Logo-min
CARB Footer Logo-min
CCAB-I Footer Logo-min

© Copyright Chartered Accountants Ireland 2020. All Rights Reserved.

☰
  • Terms & conditions
  • Privacy statement
  • Event privacy notice
LOADING...

Please wait while the page loads.