A gene string can be represented by an 8-character long string, with choices from ‘A’, ‘C’, ‘G’, and ‘T’.
Suppose we need to investigate a mutation from a gene string startGene to a gene string endGene where one mutation is defined as one single character changed in the gene string.
There is also a gene bank bank that records all the valid gene mutations. A gene must be in bank to make it a valid gene string.
Given the two gene strings startGene and endGene and the gene bank bank, return the minimum number of mutations needed to mutate from startGene to endGene. If there is no such a mutation, return -1.
Note that the starting point is assumed to be valid, so it might not be included in the bank.
Input: startGene = “AACCGGTT”, endGene = “AACCGGTA”, bank = [“AACCGGTA”]
Output: 1
Input: startGene = “AACCGGTT”, endGene = “AAACGGTA”, bank = [“AACCGGTA”,“AACCGCTA”,“AAACGGTA”]
Output: 2
From: LeetCode
Link: 433. Minimum Genetic Mutation
1. Initialization
2. Breadth-First Search (BFS)
int minMutation(char * startGene, char * endGene, char ** bank, int bankSize) {
if (bankSize == 0) return -1;
int *visited = (int *)calloc(bankSize, sizeof(int));
char **queue = (char **)malloc((bankSize + 1) * sizeof(char *));
int front = 0, rear = 0;
queue[rear++] = startGene;
int level = 0;
while (front < rear) {
int size = rear - front;
for (int i = 0; i < size; ++i) {
char *gene = queue[front++];
if (strcmp(gene, endGene) == 0) {
free(visited);
free(queue);
return level;
}
for (int j = 0; j < bankSize; ++j) {
if (visited[j]) continue;
int diff = 0;
for (int k = 0; k < 8; ++k) {
if (gene[k] != bank[j][k]) ++diff;
if (diff > 1) break;
}
if (diff == 1) {
visited[j] = 1;
queue[rear++] = bank[j];
}
}
}
++level;
}
free(visited);
free(queue);
return -1;
}