序列的有序笛卡尔积

2023年1月18日 367次阅读

在
efficient sorted Cartesian product of 2 sorted array of integers中,建议使用惰性算法为两个排序的整数数组生成有序的笛卡尔积.

我很想知道这个算法是否有更多数组的概括.

例如,假设我们有5个排序的双精度数组

(0.7,0.2,0.1)

(0.6,0.3,0.1)

(0.5,0.25,0.25)

(0.4,0.35,0.25)

(0.35,0.35,0.3)

我有兴趣生成订购的笛卡尔积,而无需计算所有可能的组合.

欣赏有关可能的懒惰笛卡尔积算法如何扩展到超过2的维度的任何想法.

最佳答案此问题似乎是统一成本搜索的枚举实例(参见例如
https://en.wikipedia.org/wiki/Dijkstra%27s_algorithm).您的状态空间由指向已排序数组的当前索引集定义.后继函数是每个数组的可能索引增量的枚举.对于5个数组的给定示例,初始状态为(0,0,0,0,0).

没有目标状态检查功能,因为我们需要经历所有可能性.如果对所有输入数组进行排序,则保证对结果进行排序.

假设我们有m个长度为n的数组,那么这个方法的复杂性是O((n ^ m).log(n(m-1)).

这是python中的示例实现：

from heapq import heappush, heappop

def cost(s, lists):
    prod = 1
    for ith, x in zip(s, lists):
        prod *= x[ith]
    return prod

def successor(s, lists):
    successors = []
    for k, (i, x) in enumerate(zip(s, lists)):
        if i < len(x) - 1: 
            t = list(s)
            t[k] += 1
            successors.append(tuple(t))
    return successors

def sorted_product(initial_state, lists):    
    fringe = []
    explored = set()
    heappush(fringe, (-cost(initial_state, lists), initial_state))
    while fringe:
        best = heappop(fringe)[1]
        yield best
        for s in successor(best, lists):
            if s not in explored:
                heappush(fringe, (-cost(s, lists), s))
                explored.add(s)

if __name__ == '__main__':
    lists = ((0.7, 0.2, 0.1),
             (0.6, 0.3, 0.1),
             (0.5, 0.25, 0.25),
             (0.4, 0.35, 0.25),
             (0.35, 0.35, 0.3))
    init_state = tuple([0]*len(lists))
    for s in sorted_product(init_state, lists):
        s_output = [x[i] for i, x in zip(s, lists)]
        v = cost(s, lists)
        print '%s %s \t%s' % (s, s_output, cost(s, lists))