computing star: algorithm

顯示具有 algorithm 標籤的文章。顯示所有文章

2009年2月7日

Range Minimum(Maximum) Query, RMQ

Input：array A[0, N]
Output： the index of minimum(maximum) value between two given indices.

RMQ的功能是若有一群資料放在array中，在經過preprocessing後，可在O(1)的時間內找出index i~j之間的最小(大)的元素。目前preprocessing最快是O(n)，也就是只需要O(n)的時間就可以處理完成。

以下介紹O(nlog(n))的preprocessing方法，O(n)的方法是由此法去改進，所以先了解此法相當重要且因程式碼容易撰寫，所以也相當適合在ACM中使用。

假設array的長度2的n次方。
建立一個大小為n x log(n)的矩陣M，用來存放RMQ的結果，其中M[i][j]是指以index i開頭，長度為2^j次方內最小元素的index。
使用dynamic programming方法建立M，需O(nlogn) time。

//time complexity: O(nlogn)
void preprocess(int M[MAXN][LOGMAXN], int A[MAXN], int N)
{
int i, j;
//initialize M for the intervals with length 1
for (i = 0; i < N; i++)
M[i][0] = i;
//compute values from smaller to bigger intervals
for (j = 1; 1 << j <= N; j++)
for (i = 0; i + (1 << j) - 1 < N; i++)
if (A[M[i][j - 1]] < A[M[i + (1 << (j - 1))][j - 1]])
M[i][j] = M[i][j - 1];
else
M[i][j] = M[i + (1 << (j - 1))][j - 1];
}

//output: the index of the minimum value between index i and j
int RMQ(int i, int j, int M[MAXN][LOGMAXN], int A[MAXN], int N)
{
if(i<0 || i>=N || j<0 || j>=N)
return -1;
if(i > j) //swap i, j
i ^= j, j ^= i, i ^= j;
int k = (int)(log(j-i)/log(2.0)),
rem = j-(1<<k) + 1;
return (A[M[i][k]] > A[M[rem][k]] ? M[rem][k] : M[i][k]);
}

Reference：

Range Minimum Query and Lowest Common Ancestor

One dimension minimum Hausdorff distance

Hausdorff distance可以用來量測兩個sets之間的距離，如果將其中一個set的元素全部加(減)上t個單位，則兩個sets之間的Hausdorff distance可能會變小或變大。實際的應用，假設我們要做影像的比對，有一張影像為template，而另一張影像是準備要和template比對的圖片，若能夠讓兩張圖片儘量對齊，則能夠提高辨識的淮確度。

my meeting report

我在IPL Vol 106(2008)第一次看到這一篇A new algorithm for computing the minimum Hausdorff distance between two point sets on a line under translation時，看了很久還是不懂演算法實際上是如何運作的，於是追蹤下去追到了源頭的兩篇論文。

源頭是Huttenlocher在1990年發表的Computing the Mimimum Hausdorff Distance for Point Sets Under Translation，這一篇提出的算法提供最初的想法，把sets中每一個元素移動t單位的軌跡的圖形找出來，再找出所有軌跡極大值中的最小值即為所求，時間複雜度是O(mnlog(mn))。

第二篇論文是Rote在1991年發表的Computing the mimimum Hausdorff distance between two point sets on a line under translation也是沿用了Huttenlocker的概念，只是他使用了一個lower bound來減少搜尋解答時所要檢查的解答數量，這個演算法的時間複雜度已經是Optimal (O(m+n)log(m+n))，所以一直以來大家都使用這個算法。

2008年的這一篇論文也是Optimal algorithm，但是所需要檢查的解答數量又更少了，時間複雜度仍然是O((m+n)log(m+n))，但實驗的結果比Rote’s algorithm快了將近15倍。

以下是我用來畫軌跡圖的Scilab script：

//one dimensional Hausdorff distance
clear;
function [dist] = hausdorff(A, B)
if(size(A, 'r') ~= 1 | size(B, 'r') ~= 1)
warning("must be one dimension array");
dist = [];
return;
end
dist = max(compute_dist(A, B), compute_dist(B, A));
endfunction
//compute distance from point to set
function [dist] = compute_dist(A, B)
m = size(A, 'c');
n = size(B, 'c');
for k=1:m
D = abs(B - A(k));
dist(k) = min(D);
end
dist = max(dist);
endfunction
function [dist,dist2] = minHausdorff(A, B, t)
if (size(A, 'r') ~=1 | size(B, 'r') ~= 1 | size(t, 'r') ~=1)
warning("must be one dimension array");
dist=[];
return;
end
m = size(A,'c');
n = size(B,'c');
len = size(t,'c');
for i=1:m
for j=1:len
dist(i, j) = compute_dist(A(i)+t(j), B);
end
end
subplot(1, 2, 1);
xlabel("t");
plot(t, dist);
for i=1:n
for j=1:len
dist2(i, j) = compute_dist(B(i), A+t(j));
end
end
subplot(1, 2, 2);
xlabel("t");
plot(t, dist2);
endfunction
A=[0, 0.5, 2, 3];
B=[0 0.3 1];
t=linspace(-3, 3, 500);
minHausdorff(A, B, t);

2008年9月17日

Coin Change

在寫ACM的時候，碰到了兩題(Q674, Q357)是關於這類型的題目，題目的大意如下：
假設我們有一些硬幣的幣值是1元，5元，10元，25元，50元，而我手中有n元，請問n可為幾種硬幣的組成方式。

Ex：n=11
11個1元硬幣=1個1元+2個5元硬幣=6個1元+1個5 元硬幣=1個1元+1個10元硬幣。
所以有4種組成的方式。
輸入值為隨意的n(0<=n<=max)，輸出為組成n的方式。我解這一題的方式是用DP：令coin[i]為硬幣的幣值，way[i]為i元時，有way[i]種組成的方式，

for (i=0; i<coin.size(); i++)
{
for(j=0;j<way.size();j++)
way[coin[i]+j] += way[j];
}

一開始的時候先計算用1元硬幣時有幾種組成的方式，然後再計算用5元硬幣時有幾種組成方式，以此類推，計算完畢後就是所求的解答。