AI Body of Knowledge’s cover photo
AI Body of Knowledge

AI Body of Knowledge

Professional Training and Coaching

Sydney, NSW 90 followers

Empowering business professionals and teams to build practical, sustainable GenAI skills that work in real environments.

About us

The AI Body of Knowledge (AIBoK) helps knowledge workers, IT professionals, executives, and board members move beyond AI curiosity to real capability with practical, sustainable GenAI skills that work in real environments.

Website
https://www.aibok.org
Industry
Professional Training and Coaching
Company size
2-10 employees
Headquarters
Sydney, NSW
Type
Privately Held
Founded
2025
Specialties
Artificial Intelligence, AI, Large Language Models, Training, and Professional Development

Locations

Updates

  • Job titles are often one of the clearest signals that AI adoption is moving from experimentation into everyday work. The important shift is not simply giving people access to AI tools. It is building the capability to use those tools safely, practically and in ways that make sense for the specific work being done. That is where organisations will start to see real value.

    Newsroom jobs are sprouting up that didn't exist 5 years ago. "Senior Editor, AI Innovation" "Editorial Director, Newsroom Engineering" Researchers combed through 6,687 LinkedIn job listings at major news organizations. They classified 234 as strategy roles. Then they narrowed it down to 16 jobs they say define the future newsroom. Politico described its Editorial Director, Newsroom Engineering role as "a player-coach who turns newsroom priorities into tools, workflows, and platforms that help reporters and editors move faster without sacrificing accuracy or voice." A player-coach. Fluent in editorial and in engineering. That role did not exist five years ago. Now it is a hiring priority at one of the most competitive newsrooms in the country. The Economist is recruiting a Senior AI Engineer for its AI Lab and listing "fine-tuning models for style or persona" as a valued skill. A news organization is hiring someone to teach AI how to sound like itself. This is not a story about AI replacing journalists. It is a story about journalism absorbing AI into its DNA and building the organizational muscle to do it well. NK POV: Job titles are a leading indicator. They tell you what an organization actually believes, not just what it says at conferences. The fact that "newsroom engineering" is now a senior editorial function at Politico signals something real: the best publishers have stopped treating technology as an IT problem and started treating it as a journalism problem. The organizations getting this right are producing better, more distinctive work. And the ones still debating whether AI belongs in the newsroom at all are falling further behind every quarter. The future is being built right now, one job description at a time. Link to job posts in the comments.

  • The adoption gap is becoming clearer: organisations do not just need access to AI tools. They need practical capability, governance, workflow understanding and safe-use habits.

    Last week I had the chance to hear the AI adoption conversation from several very different rooms.  Amazon Web Services (AWS) Summit Sydney gave the cloud, data, governance and agentic AI view. Leader EXPO gave the partner, MSP and business adoption view. BNI Quantum Sydney Sydney South Business Breakfast gave the local business-owner and SME view. These followed the Microsoft AI Tour Sydney two weeks ago, which gave the enterprise and platform view. Different rooms. Different language. Different priorities. But the same adoption gap kept showing up; the market is moving beyond “AI is exciting” and into the harder work of adoption. That is where the practical questions start: 1). What should we actually use AI for? 2). Which workflows should change? 3). What data is safe? 4). What guardrails are needed? 5). What should staff avoid? 6). Where does human judgement still matter? People are asking: “How do we move from scattered experimentation to safe, useful and repeatable workplace practice?” For me, that was the clearest signal from the week. AI adoption is not just a technology problem; it is a capability, governance, workflow and business-outcomes problem.That is the gap the AI Body of Knowledge is focused on helping organisations close.

    • AWS Summit Sydney event floor with attendees beneath a large AWS sign. The image highlights cloud, data, governance, agentic AI and the AI adoption gap.
    • Microsoft AI Tour Sydney event floor with attendees and a large digital display. Text overlay reads: “Microsoft AI Tour Sydney. Enterprise, platforms and practical adoption. Different rooms. Same adoption gap.”
    • Leader EXPO 2026 session with a Microsoft speaker presenting to an audience. Text overlay reads: “Leader EXPO 2026. Partner, MSP and business adoption view. Different rooms. Same adoption gap.”
    • BNI Quantum Sydney South Business Breakfast with a speaker presenting to local business owners and attendees. Text overlay reads: “BNI Quantum Breakfast. Local business-owner and SME view. Different rooms. Same adoption gap.”
  • A strong takeaway from the recent Microsoft AI Tour event in Sydney was that most organisations are now past the awareness stage with AI. The real challenge is helping teams use it safely, practically, and consistently in day-to-day work. That gap between experimentation and operational habit is where a lot of the real work still needs to happen.

    A practical observation from last week's Microsoft AI Tour Sydney: The things that seemed to resonate most were simple: Real examples Hands-on sessions Practical use cases tied to the work people already do Not abstract AI theory. Not tool demos for their own sake. The opportunity now is helping teams move from interest and experimentation to safe, useful, consistent use.

    • AI capability framework showing that mindset and skills matter as much as tools
  • Access to AI tools is increasing quickly. There are some fantasic tools readily available now for users, with new products and features launching every week. Capability is now the differentiator. Professionals and teams that learn how to use AI safely, usefully, and consistently will move faster than those still experimenting.

    I spent yesterday at the Microsoft AI Tour Sydney. There was a strong focus on agents, automation, Copilot, and AI-first organisations. The strongest theme for me was not just the technology itself, but how organisations are recognising the need to turn it into practical, trusted use in real work. Most teams are no longer asking whether AI matters. They are asking: - where do we start? - how do we use it safely? - what should we use it for? - how do we make it stick? That feels like where a lot of the real work still needs to happen. The big takeaway for me was not just the exciting technology or emerging products. It was clear that the conversation had moved on. Organisations are no longer watching AI from the sidelines. They are starting to ask the right questions about capability, trust, and real-world use.

    • Microsoft AI Tour Sydney expo floor with attendees and booths
    • Crowd at Microsoft AI Tour Sydney event in ICC Sydney
  • A helpful lens: the biggest value from GenAI is not content creation. It’s helping teams make sense of complexity, faster. Most organisations have a “meaning problem” before they have a “writing problem”: too many documents, too many tools, too many messages, and not enough clarity. The advantage comes when people build the capability to: - ask better questions - check the output - track changes over time - make decisions with evidence Consider this: where would turning noise into signal make the biggest difference in your organisation?

    THE ECONOMICS OF MEANING: FROM VSICALC TO SEMANTIC TRUTH The spreadsheet was a revolution because it dropped the cost of deterministic calculation to zero. You no longer needed a room full of accountants to sum a column; you just needed `=SUM(A1:A5)`. GenAI is the next phase: it drops the cost of probabilistic cognition to zero. It takes the high-entropy "mud" of our digital lives - thousands of messy emails, logs, and documents - and synthesises semantic truth on demand. Cell data -> Unstructured data Formula -> Prompt Calculation -> Inference Balance sheet -> Synthesized view What-if analysis -> "What if..." The irony is that we are still stuck in the "VisiCalc Era" of 1979. We have been given an engine capable of modelling complex reality, and mostly we are just using it to write emails slightly faster. We have essentially been handed a warp drive, and we are currently using it to run errands to the corner shop slightly faster.

    • We have essentially been handed a warp drive, and we are currently using it to run errands to the corner shop slightly faster. https://www.aibok.org/. https://www.linkedin.com/in/phamsi/.
  • GenerativeAI is not just for producing more content. Even greater value is unlocked when it helps teams check work, reduce risk, and catch issues early. Examples include: • spotting inconsistencies across documents • flagging unusual activity in systems and logs • highlighting where outcomes are drifting over time If GenAI is the new spreadsheet, the edge comes from better decisions, not faster typing.

    The industry is currently obsessed with (Writing/Generation): "Write me an email," "Write me a function," "Generate an image." This is low-value because the marginal cost of generation is zero. The true value is in (Auditing/Detection): "Read 1,000 emails and tell me who is lying," "Scan 10,000 lines of code and tell me where the logic drifts," "Watch the logs and tell me when the system violates physics." Generation creates noise. Detection creates signal. If GenAI is the "New Spreadsheet," the trillions of dollars won't be made by people using it to type faster; they will be made by people using it to calculate (reason/audit) deeper.

    • No alternative text description for this image
  • 🎄 Day 12/12: Qwen Vision-Language (Qwen2.5-VL → Qwen3-VL) The 2026 Gift: Vision-language moves from demos to real work 🎁 A “quick” task lands: pull key fields from 30 invoices. Half are PDFs, some are phone photos, one is a screenshot, and the totals do not match. Qwen vision-language models read images, video, and text together and reply in plain language, including extracted fields and the on-page evidence they came from. What makes it different in late 2025: ✅ Evidence-first outputs: it can anchor answers to a specific region of the page, so you can check the source. ✅ Better with awkward shapes: dynamic resolution helps with long receipts and ultra-wide diagrams without squashing the layout. ✅ Clear progression: Qwen3-VL is the flagship today, including “Thinking” variants that take an extra reasoning step before answering, which helps reduce confident-but-wrong numbers. When to use it: 💼 Executives: “Find the risk clause in this scanned 50-page vendor contract and show the evidence.” 💼 Knowledge workers: Turn stacks of PDFs and mobile photos into structured fields and follow-up actions. 💼 IT professionals: Build screen-aware automations for legacy systems that do not have an API. When not to use it: ⚠️ Pure text work: Claude or DeepSeek-V3 are safer defaults. ⚠️ Ultra-high volume extraction: Gemini Flash is often cheaper for simple jobs. ⚠️ Zero-engineering needs: use the multimodal features already included in Copilot or Google Workspace. The capability gap: The gap isn’t knowing multimodal AI exists; it’s knowing when your default text model is flying blind because it cannot see the evidence. Following the AI Body of Knowledge "12 Days of AI Models" series? Full guide will be published in January. Follow AI Body of Knowledge to get updates or DM us for details. 🎯 Building internal capability? DM us. We deliver Generative AI training that focuses on #CapabilityOverHype #12DaysOfAIModels #CapabilityOverHype #AIBoK2025

    • AIBoK ‘12 Days of GenAI Models’ series graphic with the subtitle ‘Daily posts, practical, vendor-neutral’, and the AIBoK logo in the bottom right.
  • 🎄 Day 11/12, Sora 2 You need a 10-15 second “what we shipped” clip for tomorrow’s board pack. You have the message. You do not have a production crew, voice talent, or a spare day. Sora 2 is OpenAI's model that turns a prompt (or a reference image) into a short video clip with synced audio. What makes it different in 2025 ✅  Video plus audio in one pass ✅  More consistent multi-shot scenes across cuts ✅  Storyboards and reusable “characters” built into the workflow When to use it 💼  Executives, a 15-second strategy snapshot for an all-hands update 💼  Knowledge workers, generate three campaign openers, pick one, then polish for review 💼  IT professionals, internal training clips that show the workflow in context When not to use it ⚠️ You need brand-perfect output, draft your stills in FLUX, then animate in your usual stack ⚠️ You only need the script and shot list, use Claude first ⚠️ If Sora 2 is not available in your country yet, use Sora 1 for silent drafts; add voice with ElevenLabs We’ve seen product marketing teams draft multiple alternate product-video openers in an afternoon, then hand the preferred concept to a designer/editor for final polish. The gap now isn’t knowing Sora exists; it’s knowing when video generation is the right lever, and when it is not. 📥 Following the AI Body of Knowledge "12 Days of AI Models" series? Full guide will be published in January. Follow AI Body of Knowledge to get updates or DM us for details. 🎯 Building internal capability? DM us. We deliver training Generative AI that covers model selection, TCO analysis, and safe deployment. #12DaysOfAIModels #CapabilityOverHype #AIBoK2025

    • AIBoK ‘12 Days of GenAI Models’ series graphic with the subtitle ‘Daily posts, practical, vendor-neutral’, and the AIBoK logo in the bottom right.
  • 🎄 Day 10/12: ElevenLabs (model family) Your team needs a 6-minute training video updated by Friday. The script changes twice, legal wants a new disclaimer, and someone asks “can we ship this in Spanish too?” You can almost hear the studio invoice printing. ElevenLabs is a voice generation platform that turns text into natural-sounding speech (text-to-speech) for voiceovers, in-product features like walkthroughs, and multilingual dubbing. What makes it different in 2025: * Voice cloning is now production-ready; you can create consistent voice from a small sample * Dubbing at scale; translate and re-voice content while keeping speaker identity and timing across dozens of languages * Real-time voice is viable; latency low enough for actual product experiences, not just demos When to use it: ✅ Executives: Record a CEO update once; dub it for global teams without re-recording ✅ Knowledge workers: Turn policy and enablement documents into audio summaries people will actually consume ✅ IT professionals: Add voice to apps, kiosks, or support flows where response time matters When not to use it: ❌ If you just need basic text-to-speech inside your cloud stack; Google or Microsoft text-to-speech can be simpler for procurement ❌ If you need speech-to-text transcription; use a dedicated speech-to-text model (this is text-to-speech) ❌ If your risk posture cannot support cloning or synthetic voices yet; start with human narration and tight governance We’re seeing product and enablement teams use high-quality text-to-speech tools to generate narration for ‘how to’ videos from the same source script as the written documentation. It keeps updates in sync, and makes refreshes easy as product features change. The gap isn’t knowing voice exists; it’s knowing when voice becomes a workflow lever, and how to ship it safely. Following the series? Full guide in January. Follow AI Body of Knowledge for updates. Building internal capability? DM us, we deliver training on model selection, evaluation, and safe use. #12DaysOfAIModels #CapabilityOverHype #AIBoK2025

    • 12 Days of GenAI Models. Daily posts, practical, vendor-neutral. Follow AI Body of Knowledge for the series. AIBoK logo and website.
    • Screenshot of the ElevenLabs website showing the text-to-speech interface with a story prompt, tone tags like ‘sarcastically’ and ‘whispers’, and tabs for text to speech, speech to text, dubbing, and voice cloning.
  • 🎄 Day 9/12: FLUX.1 and FLUX.2 (families) Marketing needs 15 creatives today. You write a precise prompt, hit generate, and get something that is almost right. The “close enough” misses are the time sink, especially when the label text is wrong and you are on revision five. That’s the reputation FLUX has built with power users in side-by-side tests: 1) Prompt adherence: it tends to do what you asked, with fewer near-misses. 2) Text and “hard prompts”: signs, labels, packaging, UI style assets, especially in the newer FLUX.2 tiers. 3) Run it your way: FLUX.1 includes options you can run locally (schnell is Apache 2.0), which matters when Legal asks if images left your environment. FLUX is Black Forest Labs’ text-to-image family. What makes it different in 2025: - Less “close enough”: stronger prompt adherence means fewer redo loops and less prompt wrestling. - Typography you can ship: FLUX.2 [flex] is tuned for small details and text-heavy assets (labels, UI, signage). - Edits that don’t drift: FLUX.2 [max] is positioned for consistent edits and tight prompt following, so “change one thing” is less likely to break three others. - Open-weights path: FLUX.1 tiers let you run locally for cost control and governance. When to use it: 💼 Executives: Setting policy for where creative AI runs and what data is allowed. 💼 Knowledge workers: Drafting campaign concepts, diagrams, and slide visuals fast, with fewer redo loops. 💼 IT professionals and teams: Deploying a self-hosted image pipeline. When not to use it: ⚠️ You need enterprise-safe licensing first: Adobe Firefly. ⚠️ You want zero setup and a slick workflow: Midjourney. ⚠️ You only need a few images occasionally: DALL·E or GPT Image. Text-to-image is an area we’ve spent a lot of time in, from early prototypes through to production pipelines. Lately we’re seeing teams prototype packaging and UI on FLUX locally, so they can iterate fast and keep early drafts off external servers. The gap isn’t knowing FLUX exists; it’s knowing when you actually need a model that’s building a reputation for three things - prompt adherence, usable text, and run-it-your-way deployment - and when open weights, licensing, and governance change the right answer. Following the series? Full guide in January. Follow AI Body of Knowledge for updates. Building internal capability? DM us, we deliver training on model selection, evaluation, and safe use. #12DaysOfAIModels #CapabilityOverHype #AIBoK2025

    • No alternative text description for this image

Similar pages