Network slicing in 5G and beyond networks allows the network to be customized for each application or service, by chaining together different virtualized network functions (VNFs) according to service requirements. The increased flexibility offered by network slicing comes at the cost of complexity in management and orchestration, which cannot be solved by traditional reactive human-in-the-loop solutions. This necessitates minimizing human intervention through the use of artificial intelligence (AI) techniques (zero-touch network management). In particular, the scaling and placement of the chain of VNFs which comprise a network slice is a complex combinatorial optimization problem which is difficult to solve effectively with traditional approaches. Driven by the benefits of deep reinforcement learning (DRL) in solving various combinatorial optimization problems, in this article, we survey various DRL-based approaches to slice scaling and placement, including different ways to model the problem and benefits of various DRL techniques in addressing specific aspects of the problem. Further, we highlight key challenges and open issues in the effective use of DRL for network slice scaling and placement.