Skip to main content

What are the differences among greedy, reluctant and possessive quantifiers?

--
Quantifiers are used to indicate the number of instances of the element (to which they are applied in the regular expression) required for a successful match. Java supports three quantifier types namely greedy, reluctant, and possessive. Greedy quantifiers try to match as much as possible while their reluctant counterparts (with ? at the end) try to match the least required to fulfill a match. What this means is that a greedy quantifier will try to match the entire line whether or not a successful match has occurred. It can turn into real performance overhead when the target text is big. Reluctant (or lazy) quantifiers quit as soon as a successful match occurs without bothering to run through the entire line. Possessive quantifiers (with + appended) are useful in optimizing the match operations since they don't keep the prior match states around (Quote from Simplify Pattern Matching by Anant Athale) For example, if your text is "abcba abcba":
  • The greedy pattern "ab.*ba" will match the substring "abcba abcba" -- the largest substring that fits the pattern

    public class Program
    {
     public static void main(String[] args){
        
       Scanner s = new Scanner("abcba abcba");  
       Pattern p = Pattern.compile("ab.*ba");
       s.findInLine(p);  
       try {
         MatchResult result = s.match();
       
         System.out.println(result.group());
         for (int i=1; i<=result.groupCount(); i++)
           System.out.println(result.group(i)); 
         s.close(); 
       }
       catch(IllegalStateException e) {
         System.out.println("No match");
       }
     }
    }
    
    The output is "abcba abcba".
  • The reluctant pattern "ab.*?ba" will match the substring "abcba" -- the first substring that fits the pattern

    public class Program
    {
     public static void main(String[] args){
        
       Scanner s = new Scanner("abcba abcba");  
       Pattern p = Pattern.compile("ab.*?ba");
       s.findInLine(p);  
       try {
         MatchResult result = s.match();
         System.out.println(result.group());
         for (int i=1; i<=result.groupCount(); i++)
           System.out.println(result.group(i)); 
         s.close(); 
       }
       catch(IllegalStateException e) {
         System.out.println("No match");
       }
     }
    }
    The out put is "abcba".
  • The possessive pattern "ab.*+ba" will not match at all, because the possessive .*+ will gobble up all of "cba abcba", including the closing "ba", and never let go of it again.

    public class Program
    {
     public static void main(String[] args){
        
       Scanner s = new Scanner("abcba abcba");  
       Pattern p = Pattern.compile("ab.*+ba");
       s.findInLine(p);  
       try {
         MatchResult result = s.match();
         System.out.println(result.group());
         for (int i=1; i<=result.groupCount(); i++)
         System.out.println(result.group(i)); 
         s.close(); 
       }
       catch(IllegalStateException e) {
         System.out.println("No match");
       }
     }
    }
    The output is "No match".

Comments

Popular posts from this blog

WebSphere MQ Interview Questions

What is MQ and what does it do? Ans. MQ stands for MESSAGE QUEUEING. WebSphere MQ allows application programs to use message queuing to participate in message-driven processing. Application programs can communicate across different platforms by using the appropriate message queuing software products. What is Message driven process? Ans . When messages arrive on a queue, they can automatically start an application using triggering. If necessary, the applications can be stopped when the message (or messages) have been processed. What are advantages of the MQ? Ans. 1. Integration. 2. Asynchrony 3. Assured Delivery 4. Scalability. How does it support the Integration? Ans. Because the MQ is independent of the Operating System you use i.e. it may be Windows, Solaris,AIX.It is independent of the protocol (i.e. TCP/IP, LU6.2, SNA, NetBIOS, UDP).It is not required that both the sender and receiver should be running on the same platform What is Asynchrony? Ans. With messag...

Asynchronous Vs. Synchronous Communications

Synchronous (One thread):   1 thread -> |<---A---->||<----B---------->||<------C----->| Synchronous (multi-threaded):   thread A -> |<---A---->| \ thread B ------------> ->|<----B---------->| \ thread C ----------------------------------> ->|<------C----->|

Advantages & Disadvantages of Synchronous / Asynchronous Communications?

  Asynchronous Communication Advantages: Requests need not be targeted to specific server. Service need not be available when request is made. No blocking, so resources could be freed.  Could use connectionless protocol Disadvantages: Response times are unpredictable. Error handling usually more complex.  Usually requires connection-oriented protocol.  Harder to design apps Synchronous Communication Advantages: Easy to program Outcome is known immediately  Error recovery easier (usually)  Better real-time response (usually) Disadvantages: Service must be up and ready. Requestor blocks, held resources are “tied up”.  Usually requires connection-oriented protocol