Look-ahead Algorithm Research Articles

Move-based iterative improvement partitioning (IIP) methods, such as the Fiduccia-Mattheyses (FM) algorithm [Fidducia and Mattheyses 1982] and Krishnamurthy's Look-Ahead (LA) algorithm [Krishnamurthy 1984], are widely used in VLSI CAD applications, largely due to their time efficiency and ease of implementation. This class of algorithms is of the "local/greedy improvement" type, and they generate relatively high-quality results for small and medium-size circuits. However, as VLSI circuits become larger, these algorithms suffer a rapid deterioration in solution quality. We propose new IIP methods CLIP and CDIP that select cells to move with a view to moving clusters that straddle the two subsets of a partition, into one of the subsets. The new algorithms significantly improve partition quality while preserving the advantage of time efficiency. Experimental results on 25 medium to large-size ACM/SIGDA benchmark circuits show up to 70% improvement over FM in mincut, and average mincut improvements of about 35% over all circuits and 47% over large circuits. They also outperform state-of-the-art non-IIP techniques, the quadratic-programming-based method Paraboli [Reiss et al. 1994] and the spectral partitioner MELO [Alpert and Yao 1995], by about 17% and 23%, respectively, with less CPU time. This demonstrates the potential of sophisticated IIP algorithms in dealing with the increasing complexity of emerging VLSI circuits. We also compare CLIP and CDIP to hMetis [Karypis et al. 1997], one of the best of the recent state-of-the-art partitioners that are based on the multilevel paradigm (others include ML c [Alpert et al. 1997] and LSR/MFFS [Cong et al. 1997]). The results show that one scheme of hMetis is 8% worse than CLIP/CDIP and the other two schemes are only 2--4% better. However, CLIP/CDIP have advantages over hMetis and other multilevel partitioners that outweigh these minimal mincut improvements. The first is much faster times-to-solution (for example, one of our best schemes CLIP-LA2 is 6.4 and 11.75 times faster than the two best hMetis schemes) and much better scalability with circuit size (e.g., for the largest circuit with about 162K nodes, CLIP-LA2 is 10.4 and and 21.5 times faster and obtains better solution qualities than the two best hMetis schemes). Second, CLIP/CDIP are "flat" partitioners, while multilevel techniques perform a sequence of node clustering/coarsening before partitioning the circuit. In complex placement applications such as timing-driven placement in the presence of multiple constraints, such circuit coarsening can hide crucial information needed for good-quality solutions, thus making the partitioning process oblivious to them. This, however, is not a problem with flat partitioners like CLIP/CDIP that can take all important parameters into account while partitioning. All these advantages make CLIP/CDIP suitable for use in complex physical design problems for large, deep-submicron VLSI circuits.

Read full abstract

A probability-based partitioning algorithm, PROP, was introduced in [8] that achieved large improvements over traditional "deterministic" iterative-improvement techniques like Fidducia-Mattheyses (FM) and Krishnamurthy's look-ahead (LA) algorithm. While PROP's gain function has a greater futuristic component than PM or LA, it incorporates spatially local information-only information on the removal probabilities of adjacent nets of a cell is used in its gain computation. This prevents a higher-level view of nonlocal structures. Also, giving uniform weights to all nets, results in an inability to differentiate between the futuristic benefit of removing one net from another. This paper investigates for the first time the issues of using nonlocal structural information in gain functions and variable net weights based on the futuristic (stochastic) benefit of moving them from the cutset. The result is a more sophisticated partitioner DEEP-PROP that performs better for circuits with large complexities by incorporating more nonlocal (second order) structural information than PROP. The second-order information is incorporated into cell gains as well as variable net weights-the latter helps to focus future cell moves in the "right" cluster around the currently moved cell and, thus, better utilizes the information that led to its selection as the best move. A lower complexity version, variable weight PROP (VAR-PROP), that also uses dynamically assigned variable net weights, but based on first-order information, has also been developed. Both versions yield significant improvements over PROP on the ACM/SIGDA benchmark suite. DEEP-PROP yields mincut improvements of as much as 39% and an average of 20% for large circuits (10-K to 25-K cells) and an average of 14% over all circuits. DEEP PROP is about a factor of 2.8 times slower than PROP, which is very fast. VAR-PROP, which has a much lower computational complexity than DEEP-PROP, yields for large circuits, maximum and average mincut improvements over PROP of 27% and 18%, respectively, and an average of 12.6% improvement over all circuits. It is only about 14% slower than PROP, For the only very large circuit golem3 in the suite (>100 K cells), the improvements produced by DEEP-PROP and VAR-PROP over PROP are 15.6% and 11.5%, respectively. We also compare DEEP-PROP to FM, PROP and hMetis for a subset of the newer 1SPD98 benchmark circuits, and demonstrate significant improvements over FM and PROP, and comparable mincuts (within 2%) to hMetis, one of the best multilevel partitioners.

Read full abstract

Look-ahead Algorithm Research Articles

Articles published on Look-ahead Algorithm

Selective Sampling for Nearest Neighbor Classifiers

The effects of look-ahead algorithms in content functional robotic intelligence

Cluster-aware iterative improvement techniques for partitioning large VLSI circuits

Partitioning using second-order information and stochastic-gain functions

Heuristics for Large Constrained Vehicle Routing Problems

No more “Partial” and “Full Looking Ahead”

Further optimized look-ahead recurrences for adjacent rows in the Padé table and Toeplitz matrix factorizations

A lookahead algorithm for the solution of block toeplitz systems

On Ramsey number R(4, 3, 3) and triangle-free edge-chromatic graphs in three colors

Improved clustered look-ahead pipelining algorithm with minimum order augmentation

Optimized look-ahead recurrences for adjacent rows in the Padé table

Computation of Numerical Padé–Hermite and Simultaneous Padé Systems II: A Weakly Stable Algorithm

Formal Development of a Task-Oriented Look-Ahead Storage Management Scheme

Why do we need numerical methods for constrained fermion systems?

A look-ahead heuristic for scheduling jobs with release dates on a single machine

An Implementation of the QMR Method Based on Coupled Two-Term Recurrences

A look-ahead algorithm for the solution of general Hankel systems

A look-ahead Levinson algorithm for general Toeplitz systems

Algorithm transformation techniques for concurrent processors

'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Look-ahead Algorithm Research Articles

Articles published on Look-ahead Algorithm

Selective Sampling for Nearest Neighbor Classifiers

The effects of look-ahead algorithms in content functional robotic intelligence

Cluster-aware iterative improvement techniques for partitioning large VLSI circuits

Partitioning using second-order information and stochastic-gain functions

Heuristics for Large Constrained Vehicle Routing Problems

No more “Partial” and “Full Looking Ahead”

Further optimized look-ahead recurrences for adjacent rows in the Padé table and Toeplitz matrix factorizations

A lookahead algorithm for the solution of block toeplitz systems

On Ramsey number R(4, 3, 3) and triangle-free edge-chromatic graphs in three colors

Improved clustered look-ahead pipelining algorithm with minimum order augmentation

Optimized look-ahead recurrences for adjacent rows in the Padé table

Computation of Numerical Padé–Hermite and Simultaneous Padé Systems II: A Weakly Stable Algorithm

Formal Development of a Task-Oriented Look-Ahead Storage Management Scheme

Why do we need numerical methods for constrained fermion systems?

A look-ahead heuristic for scheduling jobs with release dates on a single machine

An Implementation of the QMR Method Based on Coupled Two-Term Recurrences

A look-ahead algorithm for the solution of general Hankel systems

A look-ahead Levinson algorithm for general Toeplitz systems

Algorithm transformation techniques for concurrent processors

'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers.