Test Engineering Notes

🤖Тестування ШІ vs Тестування з ШІ

#testing #ai

Jeff Nyman випустив цикл статей AI and Testing для тих, кому цікаво саме тестувати ШІ моделі. А не просто користуватись інструментами. Пости великі та потребують часу для опрацювання, але це СКАРБ та MUST-READ. Статті дають доволі непогані знання про те, як працюють моделі та найголовніше - як їх тестувати.

📝Статті:

• Ollama and Models
• Local LLMs and LangChain
• LangChain Templates
• LangChain Messages
• LangChain and Orchestration
• A Testing Example
• Refining Tests
• Refactoring Tests
• Scaling Tests

❗️Крім того, Jeff поділився своїми думками про навички тестувальників - "AI and Testing: Personal Marketability" Цю статтю я теж дуже раджу почитати.

💡 Ділюся основними моментами:

• Коли у вакансії бачите "потрібен досвід з ШІ" то, зазвичай, це вміння користуватись інструментами ШІ для тестування (автоматизації). Дуже рідко зе значить саме тестування ШІ.
• Курси зараз в основному вчать першій категорії навичок. Але вивчаючи як тестувати ШІ ви так чи інакше вчитесь працювати з ШІ інструментами ефективніше.
• Важливі обидві категорії навичок.

⭐️ Що значить хороший тест кейс для ШІ?

• Тестування узгодженості в умовах суперечливої інформації
• Тестування задовільного рівня невизначеності
• Перевірка на дисперсія при однакових умовах
• Перевірка межі між знанням та висновками (міркуваннями)

🐙 Й головне:

There’s understandable anxiety in the testing community about AI replacing testers. (Just as there is for developers.) But here’s what that concern misses: the skills that make you good at testing are exactly the skills that make you valuable in an AI-augmented world. That’s the case whether you’re using AI to assist your testing or testing AI systems themselves.

Quality and test specialists have always needed an eye for spotting ambiguity, inconsistency, and contradiction. You look at a requirements document and notice where two statements can’t both be true. You examine a user interface and spot where the behavior contradicts the stated intent. You read test results and detect where the data doesn’t align with expectations. This isn’t a skill AI replaces. It’s a skill AI desperately needs applied to it.

🔥33⚡2

3.6K viewsedited 10:27