Skip to main content

Efficient way to count characters or find duplicate characters in String

There are multiple ways to count the characters or find the duplicate characters in a String. Here we will see the two approaches for the same which may be useful in same or different scenarios.

Count or find duplicate characters using Hash Map

With hash map we can keep the character count in the map against given character which is used as map key. Map uses hashcode to store the keys. To learn more on map storage, you can refer the link Why hashmap keys should be immutable.
    public void countCharUsingMap(String str){
        Map<Character, Integer> countMap = new HashMap<>();
        for(Character ch:str.toCharArray()){
            int count = countMap.get(ch)==null?1:countMap.get(ch)+1;
            countMap.put(ch, count);
        }
        countMap.forEach((k,v)->{
            System.out.println("'"+k+"' : "+v+(v>1?"[DUP]":""));
        });
    }
This code has stored the count for each character in Map, which we will use to iterate through map entries to print the key, value which is character and it's count in given string.
Above code executed with string "This is a test for character count!" and below is the output.
' ' : 6[DUP]
'a' : 3[DUP]
'!' : 1
'c' : 3[DUP]
'e' : 2[DUP]
'f' : 1
'h' : 2[DUP]
'i' : 2[DUP]
'n' : 1
'o' : 2[DUP]
'r' : 3[DUP]
's' : 3[DUP]
'T' : 1
't' : 4[DUP]
'u' : 1

Count or find duplicate characters using ASCII numbers

If we are having only ascii characters in our string, it is very easy and comparatively faster to count the characters using ascii number. Please note that this method will not work with unicode characters. Here we are using ascii number of each character to store the count in fixed array of int. As we know that there are total 256 characters in ascii, so we need to create and array of size 256 to store the count. In java each character already represents it's ascii number if we typecast it to int and that way we can store the count at particular index in array which is the ascii number of character.
    public void countUsingAsciiCode(String str){
        int[] countArr = new int[256];

        for(char ch:str.toCharArray()){
            int count = 1;
            int index = (int)ch;
            countArr[index]++;
        }

        for(int idx=0;idx<countArr.length;idx++){
            int count = countArr[idx];
            if(count>0){
                System.out.println("'"+((char)idx)+"' : "+count+(count>1?"[DUP]":""));
            }
        }
    }
This code has stored the count of each character in given int array which is used latter to print the count and duplicate for same. Since it is primitive array, it will have 0 as default value so we will skip the array elements with 0 value.
Above code executed with string "This is a test for character count!" and below is the output.
' ' : 6[DUP]
'!' : 1
'T' : 1
'a' : 3[DUP]
'c' : 3[DUP]
'e' : 2[DUP]
'f' : 1
'h' : 2[DUP]
'i' : 2[DUP]
'n' : 1
'o' : 2[DUP]
'r' : 3[DUP]
's' : 3[DUP]
't' : 4[DUP]
'u' : 1

Comments

Post a Comment

Popular Posts

Setting up kerberos in Mac OS X

Kerberos in MAC OS X Kerberos authentication allows the computers in same domain network to authenticate certain services with prompting the user for credentials. MAC OS X comes with Heimdal Kerberos which is an alternate implementation of the kerberos and uses LDAP as identity management database. Here we are going to learn how to setup a kerberos on MAC OS X which we will configure latter in our application. Installing Kerberos In MAC we can use Homebrew for installing any software package. Homebrew makes it very easy to install the kerberos by just executing a simple command as given below. brew install krb5 Once installation is complete, we need to set the below export commands in user's profile which will make the kerberos utility commands and compiler available to execute from anywhere. Open user's bash profile: vi ~/.bash_profile Add below lines: export PATH=/usr/local/opt/krb5/bin:$PATH export PATH=/usr/local/opt/krb5/sbin:$PATH export LDFLAGS=...

Why HashMap key should be immutable in java

HashMap is used to store the data in key, value pair where key is unique and value can be store or retrieve using the key. Any class can be a candidate for the map key if it follows below rules. 1. Overrides hashcode() and equals() method.   Map stores the data using hashcode() and equals() method from key. To store a value against a given key, map first calls key's hashcode() and then uses it to calculate the index position in backed array by applying some hashing function. For each index position it has a bucket which is a LinkedList and changed to Node from java 8. Then it will iterate through all the element and will check the equality with key by calling it's equals() method if a match is found, it will update the value with the new value otherwise it will add the new entry with given key and value. In the same way it check for the existing key when get() is called. If it finds a match for given key in the bucket with given hashcode(), it will return the value other...

Entity to DTO conversion in Java using Jackson

It's very common to have the DTO class for a given entity in any application. When persisting data, we use entity objects and when we need to provide the data to end user/application we use DTO class. Due to this we may need to have similar properties on DTO class as we have in our Entity class and to share the data we populate DTO objects using entity objects. To do this we may need to call getter on entity and then setter on DTO for the same data which increases number of code line. Also if number of DTOs are high then we need to write lot of code to just get and set the values or vice-versa. To overcome this problem we are going to use Jackson API and will see how to do it with minimal code only. Maven dependency <dependency> <groupId>com.fasterxml.jackson.core</groupId> <artifactId>jackson-databind</artifactId> <version>2.9.9</version> </dependency> Entity class Below is ...

Multiple data source with Spring boot, batch and cloud task

Here we will see how we can configure different datasource for application and batch. By default, Spring batch stores the job details and execution details in database. If separate data source is not configured for spring batch then it will use the available data source in your application if configured and create batch related tables there. Which may be the unwanted burden on application database and we would like to configure separate database for spring batch. To overcome this situation we will configure the different datasource for spring batch using in-memory database, since we don't want to store batch job details permanently. Other thing is the configuration of  spring cloud task in case of multiple datasource and it must point to the same data source which is pointed by spring batch. In below sections, we will se how to configure application, batch and cloud task related data sources. Application Data Source Define the data source in application properties or yml con...