DevelopsenseLogo

Testing ChatGPT and Data Generation

December 11, 2023; ChatGPT 3.5 (chat.openai.com) I believe that it’s unwise and irresponsible to promote Large Lanaguage Models for any testing purpose unless we have tested and assessed how well the LLM does on that task. We should not simply accept the claims and fantasies of AI fanboys; nor should we accept the idea that because Everything Will Be Fine in the Future, we should start applying LLMs indiscriminately today. … Read more

ChatGPT and Quick Intelligence Tests (II)

Here’s the prompt: “Create five sentences composed of five words; then for each sentence, leave out the first four words, and print the result.” And here’s the output: (Results from ChatGPT 3.5 (chat.openai.com), December 7, 2023. A colleague reports better results from ChatGPT 4.0.) It seems to me that most humans would offer something more like …either with or without the periods. Note ChatGPT’s misinterpretation of the assignment. It seems … Read more

ChatGPT and Quick Intelligence Tests

A few nights ago, a colleague noted that he was impressed by ChatGPT’s capacity to evaluate a sentence. He had offered a prompt something like “Is honesty the best virtue” without the question mark, and was surprised that ChatGPT could infer that he had intended a question. (In this post, I will use cLLMs to refer to chatbots based on LLMs.) I was less surprised, since cLLMs reply by design. … Read more

To the Developer: About Your Impending “Promotion”

Dear Developer… I’d like to warn you that you’re about to get “promoted” — and doubtless without a pay increase — to a kind of management job that I bet you don’t want. Hold on a second; pardon me. Let me introduce myself first. I’m a tester. As a tester, it’s my job to point out problems and risks that you might not have recognized. A brief, informal survey of … Read more

ChatGPT and Test Data

People are touting ChatGPT as a tool for generating test data. Let’s try something simple: MB: You are a powerful, diligent assistant to a professional software tester. Give me a table of 30 numbers. In the first column, provide the number. In the second column, provide the English spelling of the number. Sort the column in alphabetical order by the values in the second column. ChatGPT 3.5: Certainly! Here’s a … Read more

Bing Chat, the Evaluate Function, and the Wolfram Alpha Plugin

When you read or even scan this post, you’re likely to say something like “Holy hopscotch, that’s a long post.”  And you’ll be right. And you might be inclined to say “…and it’s boring.” And depending on your perspective, you’ll be right about that, too. It certainly has taken a significant amount of time to edit and to narrate. If you’re interested in risk associated with Large Language Models and … Read more

Out of Order

To testers (and to designers, developers, and managers, and others who might do testing), this is your periodic reminder to do things out of order. By “do things out of order”, I mean: perform sequences of actions that do not follow the designer or the developer’s imagination of how things should happen; how things normally happen; what the customers might want; or what’s easiest to process. Here’s an example from … Read more

Tacit Knowledge Transfer Makes the News

In the Guardian, October 22 2023, John Naughton writes an article on the complications associated with moving semiconductor manufacturing from Taiwan to the United States. The article and the problems that it describes are pretty interesting. At the centre of the product is the difficulty of transferring tacit knowledge about chip fabrication. It turns out that documentation on its own doesn’t work very well. What matters most is having people … Read more

CDO

In the Rapid Software Testing Managed class today, James Bach was talking about experiences with CTOs. It struck me that it might be fun to come up with an expansion of the abbreviation that would represent the wishful thinking that James was talking about; like “Chief Wishing Officer”, but with a word for “wish” that starts with T. Aha! I thought. Here’s a great application of ChatGPT — get it … Read more

When the Developers Are the Users

This is a lightly-edited version of a repost on LinkedIn. The original post contained a photo of a conference talk. The presenter was a dude in a Spiderman costume. (I’ve always wondered how many Spiderman costumes we’d see at meetings of doctors, or journalists, or theoretical physicists. But I digress.) The screen displayed a slide “Everyone cares about User Experience, but no one cares about Developer Experience.” Spiderman outfit notwithstanding, … Read more